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Human Chitinase Alpha and Chitinase Alpha-2 
Background of the Invention 
Field of the Invention 

The present invention relates to novel human chitinase genes - chitinase 
alpha and chitinase alpha-2. More particularly, the invention relates to nucleic 
acid sequences encoding mature chitinase alpha or mature chitinase alpha-2 or 
fragments thereof, the use of such polypeptides for treating disease, as well as 
processes for producing these polypeptides recombinantly. 

RelatedArt 

The generation or destruction of tissue requires constant reorganization 
and restructuring of the extracellular matrix (ECM) components, including 
interstitial coUagens, basement membrane collagen, fibronectin, laminin, aggrecan, 
and various proteoglycans. Heinegard and Oldberg, FASEB J. 3:2042-2051 
(1989). Normal types of remodeling processes include embryonic development, 
post-partum involution of the uterus, ovulation, wound healing, and bone and 
growth plate remodeling. Woessner et al, Steroids 54\ 491-499 (1989); Weeks 
et al, Biochim, Biophys, Acta 445:205-214 (1976); Lepage and Cache, EMSd 
J. 9:3003-3012 (1990); Wride and Sanders. Dev-Dyn, /95(3):225-239 (1993), 
Similar processes also occur in disease states such as joint destruction in 
rfieumatoid and osteoarthritis, periodontia, and tumor cell metastasis. Thompson 
and Oegema, J, Bom Joint Surg. 67:407-416 (1979); Reynolds et ai, Adv-Dent- 
Res. S(2):312-319 (1994). One example of these processes is the migration of 
macrophages to the site of inflammation as in the case of synovial tissue in 
rheumatoid arthritis. Cutolo et al, Clin. andExper Rheum. 11:331-339 (1993). 
The ECM components are regulated, in both normal and disease states, by various 
exogenous and endogenous factors. For example, in tumor formation, the 
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differentiation state of the cell can increase the rate of degradation of the ECM. 
Benya, Pathol Immumpathoi Res. 7:51-54 (1988). Likewise, the presence of 
metalloproteinases or their inhibitors can alter the composition of the ECM. An 
imbalance of metalloproteinases and tissue inhibitors of matrix metalloproteinases 
(TIMP) has been shown to contribute to the pathogenesis of osteoarthritis. Dean 
et ai, 1 Clin Invest 5^:678-685 (1989). Cytokines, growth factors, and the 
extracellular environment can all contribute to the alteration of the ECM. Tyler, 
Biochem 1 227M9'%1% (1985); Dinarello, Sem, Immunol ^: 133-145 (1992); 
McConnelle/a/.,y. CellBiol /05:1087-1098-(1987). 

The growth of cartilage and bone is actualized by cells such as articular 
chondrocytes and osteoblasts. The main function of these cells in immature tissue 
is the deposition and remodeling of the cartilage or bone matrix. In adult tissue, 
these cells maintain this matrix in order to ensure its proper function. In both 
case, this «icompasses secretion of proteins involved in the turnover of the ECM. 

A major species of protein secreted by these cells and involved in the 
turnover of the ECM are the metalloproteinases. Woessner, FASEB J. J:214- 
2154 (1991). As will be discussed below, new types of secretory glycoproteins 
have also been identified in human cartilage, osteoblasts, synovial cells, sheep and 
bovine oviduct and mammary cells, and macrophages. Nyrikos and Golds, 
Biochem. X 2tf<?:265-268 (1990); Hakala eial, I Biol Chem.268(34):2SS03- 
25810 (1993); Johansen et al, / Bone and Min Res, 7(5):501-511 (1992); 
Rejman and Huriey, Biochem, Biophys. Res. Commm. 750:329-334 (1988); 
DeSouza and Murray, Endocrinology ;J6(6):2485-2496 (1995); Hollak et al, 
1 Clin. Invest. 9J: 1288-1292 (1995); Arias etal, Biol of Reproduction J/:685- 
694 (1994). These proteins all share regions of significant homology to bacterial 
and fiangal chitinases, and therefore are referred to herein as "chitinase-like 
proteins." They bear a subtle similarity to lysozymes from mammals and function 
as cndoglycosidases, with a specificity for N-acetyl-glucosamine linkages. 
However, these types of chitin-like structures (homopolymers of N-acetyl- 
glucosamine) were not normally encountered in mammalian tissue. 



wo 98/Q6S59 rc i/i;»yo/i juuj 



The human cartilage glycoprotein, HC gp-39, is a mammalian chitinase- 
like protein with an apparent molecular weight of approximately 39 kDa that is 
secreted by both articular chondrocytes and synovial fibroblasts. Nyrikos and 
Golds, supra, Hakala et aL, supra. This protein has been described as a marker 
5 for joint injury, appearing in the blood and synovial fluid from patients diagnosed 

with rheumatoid arthritis. Johansen ei al., British J. of Rheumatology J2:949-955 
(1993). The gene encoding this protein has been cloned. Hakala et aL, supra. 
Neither the protein nor ntflNA for HC gp-39 was detectable in normal newborn 
or adult human articular cartilage obtained at surgery, but was detectable both in 
10 synovial specimens and in cartilage obtained from patients with rheumatoid 

arthritis. Hakala et a/., supra. The observations of Hakala et aL suggest that the 
expression of HC-gp-39 may be related to a response of these cells to an altered 
tissue environment. 

The protein YKL-40 has also been identified as one of the major secretory 
1 5 products of cultured human osteoblastic cells (osteocarcinoma cell line MG-63) 

expressed in response to 1,25-dihydroxyvitamin D3 stimulation. Johansen e/ a/., 
J. Bone and Min. Res. 7(5):501-511 (1992); Johansen et aL, British J. of 
Rheumatology J2:949-955 (1993), The N-terminal portion of YKL-40 was 
sequenced and found to be identical to HC gp-39. Upon further sequencing, 
20 YKL^O and HC gp-39 were found to be identical. 

Chitotriosidase is an enzyme that has been identified as a member of the 
chitinase protein family. Renkema et aL, J. BioL Chem. 270:2198-2202 (1995); 
Hollak et aL, / Clin, Invest, 93:1288-1292 (1994). This protein also has an 
apparent molecular weight of 39 kDa and shares N-terminal homologies with HC 
25 gp-39, the bovine mammary protein, and several bacterial chitinases. Activity of 

this enzyme was originally detected fi-om cells of patients afflicted with Gaucher 
Disease (GD). Gaucher Disease is an inherited deficiency in the activity of 
glucocerebrosidase, lysosomal hydrolase. The defect results in an accumulation 
of glucosylceramide (glucocerebroside) in the lysosomes of macrophages. 
30 Accumulation of lipid-laden macrophages results in hepatosplenomegaly, bone 

lesions, and neurological anomalies. After morphological differentiation of 



monocytes in the macrophages in culture, the cells begin to produce secrete 
increasing amounts of chitotriosidase. This increase was, on average, 600 times 
greater in symptomatic GD patients than in healthy volunteers. Hollak et al, 
supra. The elevation in chitotriosidase activity can be effectively reduced, 
however, upon initiation of enzyme supplementation therapy. 

Unlike other members of the family of proteins homologous to chitinase, 
chitotriosidase does have chitinolytic activity. Renkema et al, supra. Like the 
bacterial enzyme, it has ability to degrade chitin azure, a polymer of betal-4-linked 
N-acetylglucosamine moieties. Chitotriosidase, along with HC gp-39, is believed 
to be involved in tissue remodeling in the mammalian cell, and thus may serve as 
useful tools in the development of therapeutics and diagnostics for tissue 
remodeling disorders. For example, the enzyme can serve as a valuable diagnostic 
marker for Gaucher disease, and may be useful in monitoring the efficacy of 
therapy. 

Chitin is a major structural component of the cell walls of fungal 
pathogOTs. Chitinases are enzymes found in a variety of species, including plants, 
bacteria, fiingi, insects, and fish (Flach e/a/., Experientia ^5:701-716 (1992)), that 
hydrolyze chitin or chitin-like substrates, such as 4-methylumbeUifetyI 
chitotrioside. It is well documented that plant chitinases are potent inhibitors of 
fangal growth. Broekaert era/., Physiol Mol Plant Pathol 35:319-331 (1988); 
Roberts et al., J. Gen Microbiol 134: 169-174 (1988). Thus, plant chitinases may 
protect plants against fungi and other pathogens. It has often been suggested that 
humans and other mammals do not possess an analogous chitinase. Raghavan et 
al. Infect, Immun, 62:1901-1908 (1994); Huber etal, Proc. Natl Acad Sci, 
USA ^5:2807.2810 (1991). This may not be the case. 

Hollak et al, discussed supra, has now observed that chitotriosidase is 
secreted in large quantities by activated human macrophages and has been 
reported to be markedly elevated in the plasma of patients suffering from Gaucher 
disease. Boot et al, J, Biol. Chem. 27(?:26252-26256 (1995), has recently 
reported on the cloning of chitotriosidase. This report also suggests that this 
enzyme may play a role in the degradation of chitin-containing pathogens. Escott 



and Adams, Infect Immun. 63:A710'4V3 (1995), report on the detection of a 
chitinase in human serum and leukocytes that cleaves colloidal [^H] chitin. They 
report that the chitinase activity detected in human granulocytes may reflect that 
fact that the enzyme in humans may play a protective role against fungi and other 
chitin-containing microbial pathogens. This role is conceivable in view of the 
well-documented role of plant chitinases as potent inhibitors of fungal growth. 

Accordingly, there is a need in the art for the identification of novel human 
chitinase enzymes, which along with chitotriosidase and HC gp-39, can serve as 
useful tools in the development of therapeutics and diagnostics for tissue 
remodeling disorders or as anti-fungal agents. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding a human chitinase alpha polypeptide having the amino 
acid sequence in Figure 1 [SEQ ID N0:2} or Figure 2 [SEQ ID N0:4] or the 
amino acid sequence encoded by the cDNA clone deposited in a bacterial host as 
ATCC Deposit Number 97529 on May 3, 1996. The present invention further 
provides isolated nucleic acid molecules comprising a polynucleotide encoding a 
human chitinase alpha-2 polypeptide having the amino acid sequence in Figure 5 
[SEQ ID NO: 12]. 

The nucleotide sequence determined by sequencing the deposited chitinase 
alpha cDNA done, which is shown in Figure 1 [SEQ ID NO: 1], contains an open 
reading frame encoding a polypeptide of about 421 amino acid residues including 
an initiation codon at nucleotide positions 82-84, a leader sequence of about 26 
amino acid residues and a deduced molecular weight of about 44 IcDa. The 395 
amino acids of the predicted mature chitinase alpha protein is shown in Figure 1 
(last 395 residues) and in SEQ ID N0:2 (from amino acid residue 27 to residue 
421). 

The nucleotide sequence shown in Figure 5 [SEQ ID NO: 1 1 ] contains an 
open reading frame encoding a polypeptide of about 385 amino acid residues 
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including an initiation codon at nucleotide positions 27-29 and a leader sequence 
of about 20 amino acid residues. The 365 amino acids of the predicted mature 
chitinase alpha-2 protein is shown in Figure 5 (last 365 residues) and in SEQ JD 
NO: 12 (from amino acid residue 21 to residue 385). As discussed in more detail 
below, the chitinase alphd-2 gene is a shorter variant of the chitinase alpha gene. 

Thus, one aspect of the invention provides isolated nucleic acid molecules 
compriang a polynucleotide having a nucleotide sequence selected from the group 
consisting of (a) a nucleotide sequence encoding the chitinase alpha polypeptide 
having the complete amino acid sequence shown in Figure 1 [SEQ ID NO:2] or 
Figure 2 [SEQ ID N0:4]; (b) a nucleotide sequence encoding the mature 
chitinase alpha polypeptide having the amino acid sequence at positions 27-421 
in Figure 1 [SEQ ID N0:2] or Figure 2 [SEQ ID N0:4]; (c) a nucleotide 
sequence encoding the chitinase alpha polypeptide having the complete amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 97529; (d) 
a nucleotide sequence encoding the mature chitinase alpha polypeptide having the 
amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
97529; (e) a nucleotide sequence encoding the chitinase alpha-2 polypeptide 
having the complete amino acid sequence shown in Figure 5 [SEQ ID NO: 12]; 
(f) a nucleotide sequence encoding the mature chitinase aipha-2 polypeptide 
having the amino acid sequence at positions 2 1-385 in Figure 5 [SEQ ID NO: 12]; 
and (g) a nucleotide sequence complementary to any of the nucleotide sequences 
in (a), (b), (c), (d), (e) or (f) above. Preferably, the nucleic acid molecule will 
encode the mature polypeptide as shown in Figure I [SEQ ID N0:2], Figure 2 
[SEQ ID N0:4], Figure 5 [SEQ ID NO: 12] or as encoded by the above-described 
deposited cDNA. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to 
any of the nucleotide sequences in (a), (b)» (c), (d), (e), (f), or (g) above, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence identical to a nucleotide sequence in 



(a), (b), (c), (d), (e), (f) or (g) above. The polynucleotide which hybrizes does not 
hybridize under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. An 
additonal nucleic acid embodiment of the invention relates to an isolated nucleic 
acid molecule comprising a polynucleotide which encodes the amino acid 
sequence of an epitope-bearing portion of an chitinase alpha polypeptide having 
an amino acid sequence in (a), (b), (c), (d). (e), or ({) above. 

The present invention also relates to recombinant vectors which include 
the isolated nucleic acid molecules of the present invention and host ceUs 
containing the recombinant vectors, as well as methods of making such vectors 
and host cells and for using them for production of chitinase alpha- and chitinase 
alpha-2- polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated chitinase alpha or chitinase 
alpha-2 polypeptide having amino acid sequence selected from the group 
consisting of: (a) the amino acid sequence of the chitinase alpha polypeptide 
having the complete 421 amino acid sequence including the leader sequence 
shown in Figure 1 [SEQ ID N0:2] or Figure 2 [SEQ ID N0:4]; (b) the amino 
acid sequence of the mature chitinase alpha polypeptide (without the leader) 
having the amino acid sequence at positions 27^21 in Figure 1 [SEQ ID N0:2] 
or Figure 2 [SEQ ID N0:4]; (c) the amino acid sequence of the chitinase alpha 
polypeptide having the complete amino acid sequence including the leader 
encoded by the cDNA clone contained in ATCC Deposit No. 97529; (d) the 
amino acid sequence of the mature chitinase alpha polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
97529; (e) the amino acid sequence of the chitinase alpha-2 polypeptide having 
the complete 385 amino acid sequence including the leader sequence shown in 
Figure 5 [SEQ ID NO: 12]; and (f) the amino acid sequence of the mature chitinase 
alpha-2 polypeptide (without the leader) having the amino acid sequence at 
positions 21-385 in Figure 5 [SEQ ID N0:12]. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 90% similarity, more preferably at least 95% 
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similarity to those described in (a), (b), (c), (d), (e), or (0 above, as well as 
polypeptides having an amino acid sequence at least 80% identical, more 
preferably at least 90% identical, and still more preferably 95%, 97%, 98% or 
99% identical to those above. 
5 An additional embodiment of this aspect of the invention relates to a 

peptide or polypeptide which has the amino acid sequence of an epitope-bearing 
portion of a chitinase alpha polypeptide or a chitinase alpha-2 polypeptide having 
an amino acid sequence described in (a), (b), (c), (d), (e), or (f) above. Peptides 
or polypeptides having the amino acid sequence of an epitope-bearing portion of 

10 a chitinase alpha polypeptide or a chitinase aIpha-2 polypeptide of the invention 

include portions of such polypeptides with at least six or seven, preferably at least 
nine, and more preferably at least about 30 amino acids to about 50 amino acids, 
although epitope*bearing polypeptides of any length up to and including the entire 
amino acid sequence of a polypeptide of the invention described above also are 

15 included in the invention. In another embodiment the invention provides an 

isolated antibody that binds specifically to a chitinase alpha polypeptide or 
chitinase aIpha-2 polypeptide having an amino acid sequence described in (a), (b), 
(c), (d), (e), or (f) above. 

Chitinase alpha and chitinase alpha-2, which are secreted, and which have 

20 significant homology to human cartilage glycoprotein-39 (HC gp-39) and to 

chitotriosidase, are believed by the present inventors to be involved in tissue 
remodeling. Therefore, chitinase alpha and chitinase alpha-2 are useful targets for 
the inhibition of tissue remodeling disease processes, such as rheumatoid arthritis. 
Accordingly, the invention is related to a method of treating an individual in need 

25 of a decreased level of chitinase alpha or chitinase alpha-2 activity in the body, 

which involves administering to such an individual a composition comprising an 
antagonist to chitinase alpha and/or chitinase alpha-2 such as anti-chitinase alpha- 
or anti-chitinase aIpha-2-antibodies, 

It is also believed by the inventors that the chitinase alpha polypeptide and 

30 the chitinase alpha-2 polypeptide of the invention can be used to treat fungal 

infection, as chitin is a major component of the fungal cell wall, and chitnases 



cleave this polymer. Thus, an additional aspect of the invention is related to a 
method for treating flingal infection, which involves administering to such an 
indivdual a composition comprising a chitinase alpha polypeptide or a chitinase 
alpha-2 polypeptide of the invention. 

Further, for a number of disorders, it is believed by the inventors that 
significantly higher or lower levels of chitinase alpha and/or chitinase alpha-2 gene 
expression can be detected in bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to 
a "standard" chitinase alpha or chitinase aIpha-2 gene expression level, i.e., the 
chitinase alpha or chitinase aipha-2 gene expression level in bodily fluids from an 
individual not having the disorder. Thus, the invention provides a diagnostic 
method useful during diagnosis of a disorder related to an abnormal level of 
chitinase alpha or chitinase alpha-2 gene expression, which involves (a) assaying 
chitinase alpha or chitinase alpha-2 gene expression level in cells or body fluid of 
that individual; (b) comparing that chitinase alpha or chitinase alpha-2 gene 
expression level with a standard chitinase alpha or chitinase alpha-2 gene 
expression level, whereby an increase or decrease in the assayed chitinase alpha 
or chitinase alpha-2 gene expression level compared to the standard expression 
level is indicative of a disorder. 

Brief Description of the Figures 

Figure 1 provides the coding sequence (SEQ ID NO: I) and deduced 
amino acid sequence (SEQ ID N0 .2) of the polypeptide chitinase alpha. 

Figure 2 provides the coding sequence (SEQ ID N0:3) and deduced 
amino acid sequence (SEQ ID N0:4) of a chitinase alpha variant. The nucleotide 
sequence is modified from Figure 1 at positions 607-609. 

Figure 3 shows regions of similarity between the amino acid sequences of 
chitinase alpha (HPMFW51) (SEQ ID N0:4), HC gp-39 (SEQ ID N0:5), and 
chitotriosidase (SEQ ID NO:6). 
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Figure 4 shows an analysis of the chitinase alpha amino acid sequence. 
In particular, the Kyte-Dooiittle hydrophilicity plot and the Jameson-Wolf 
antigenic inex are provided. Amino acid residues Leu38-AIa51; Asn66-Phe85; 
Asnll7-Lysl27; Metl45-Leul53; Asnl65-Aspl72; Tyrl77.Hisl85; Phe245- 
5 Ile254; Gly256-Tyr274; Lys286.Tyr293; Gly366-Lys376; and Asp394-Pro407 

in Figure I are highly antigenic regions of the chitinase alpha protein. 

Figure 5 provides the coding sequence (SEQ ID N0:1 1) and deduced 
amino acid sequence (SEQ ID NO: 12) of a shortened chitinase alpha variant - 
chitinase alpha-2. 

1 0 Figures 6(a-d) provide a comparison of the nucleotide (a-c) and deduced 

amino acid (d) sequences of chitinase alpha-2 and chitinase alpha. 



Detailed Description of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding the chitinase alpha protein having the amino acid 

15 sequence shown in Figure 1 [SEQ ID N0:2] which was determined by 

sequencing a cloned cDNA. The nucleotide sequence shown in Figure I [SEQ ID 
N0:1] was obtained by sequencing the hpmfwSl cDNA clone encoding a 
chitinase alpha polypeptide, which was deposited on May 3 , 1996 at the 
American Type Culture Collection, 12301 Park Lawn Drive, Rockville, Maryland 

20 20852, and given accession number 97529. The deposited clone is contained in 

the pBluescript SK(-) plasmid (Stratagene, LaJolla, CA). 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the chitinase alpha-2 protein having the 
amino acid sequence shown in Figure 5 [SEQ ID NO: 12]. 



RECTIFIED SHEET (RULE 91) 



11 

Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using an automated DNA 
sequence (such as the Model 373 from Applied Biosystems, Inc.), and alt amino 
acid sequences of polypeptides encoded by DNA molecules determined herein 
were predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this 
automated approach, any nucleotide sequence determined herein may contain a 
some errors. Nucleotide sequences determined by automation are typically at least 
about 90% identical, more typically at least about 95% to at least about 99.9% 
identical to the actual nucleotide sequence of the sequenced DNA molecule. The 
actual sequence can be more precisely determined by other approaches including 
manual DNA sequencing methods well known in the art. As is also known in the 
art, a single insertion or deletion in a determined nucleotide sequence compared 
to the actual sequence will cause a frame shift in translation of the nucleotide 
sequence such that the predicted amino acid sequence encoded by a determined 
nucleotide sequence will be completely different from the amino acid sequence 
actually encoded by the sequwced DNA molecule, beginning at the point of such 
an insertion or deletion. 

Unless othenvise indicated, each "nucleotide sequence" set forth herein is 
presented as a sequence of deoxyribonucleotides (abbreviated A, G , C and T). 
However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U) where each thymidine 
deoxynucleotide (T) in the specified deoxynucleotide sequence in is replaced by 
the ribonucleotide uridine (U). For instance, reference to an RNA molecule 
having the sequence of SEQ ID NO: 1 set forth using deoxyribonucleotide 
abbreviations is intended to indicate an RNA molecule having a sequence in 
which each deoxynucleotide A, G or C of SEQ ID NO: 1 has been replaced by the 
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corresponding ribonucleotide A, G or C, and each deoxynucleotide T has been 
replaced by a ribonucleotide U. 

Using the infonnation provided herein, such as the nucleotide sequence in 
Figure 1 [SEQ ID NO: I], Figure 2 [SEQ ID N0:3], or Figure 5 [SEQ ID NO: 1 1], 
5 a nucleic acid molecule of the present invention encoding a chitinase alpha or 

chitinase aipha-2 polypeptide may be obtained using standard cloning and 
screening procedures, such as those for cloning cDNAs using mRNA as starting 
material. Illustrative of the invention, the nucleic acid molecule described in 
Figure 1 [SEQ ID NO: 1 ] was discovered in a cDNA library derived from human 

1 0 placenta tissue. The determined nucleotide sequence of the chitinase alpha cDN A 

of Figure I contains an open reading frame encoding a protein of about 421 amino 
acid residues with an initiation codon at positions 82-84 of the nucleotide 
sequmce shown in Figure 1 [SEQ ID NO, 1], and a predicted leader sequence of 
about 26 amino acid residues, and a deduced molecular weight of about 44 kDa 

1 5 without the leader and about 46.6 IcDa with the leader. The amino acid sequence 

of the predicted mature chitinase alpha protein is also shown in Figure 1 [SEQ ID 
N0:2] from about amino acid residue 27 to about residue 421. The chitinase 
alpha protein shown in Figure 1 [SEQ ID N0:2] is about 52% identical and about 
68% similar to human gp-39, and about 52% identical and about 70% similar to 

20 chitotriosidase (Figure 3 ) , 

The determined nucleotide sequence of the chitinase alpha-2 cDNA of 
Figure 5 contains an open reading frame encoding a protein of about 385 amino 
acid residues with an initiation codon at positions 27-29 of the nucleotide 
sequence shown in Figure 5 [SEQ ID NO: 1 1], and a predicted leader sequence 

25 of about 20 amino acid residues. The amino acid sequence of the predicted 

mature chitinase alpha-2 protein is also shown in Figure 5 [SEQ ED NO: 12] from 
about amino acid residue 21 to about residue 385. The chitinase alpha-2 
nucleotide sequence shown in Figure 5 [SEQ ID N0:1 1] is about 99% identical 
and about 99 % similar to the chitinase alpha nucleotide sequence shown in Figure 

30 1 [SEQIDNO:!]. 
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As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors discussed above, as well as the variability of cleavage sites for 
leaders in different known proteins, the actual chitinase alpha polypeptide encoded 
by the deposited cDNA comprises about 421 amino acids» but may be anywhere 
5 in the range of 4 10-432 amino acids; and the actual leader sequence of this protein 

is about 26 amino acids, but may be anywhere in the range of about 21 to about 
3 1 amino acids. Similarly, the actual chitinase alpha-2 polypeptide comprises 
about 385 amino adds, but may be anywhere in the range of 375-395 amino acids; 
and the actual leader sequence of this protein is about 20 amino acids, but may be 

10 anywhere in the range of about 16 to about 25 amino acids. 

As indicated, nucleic acid molecules of the present invention may be in the 
form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 

15 may be the coding strand, also known as the sense strand, or it may be the non- 

coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid moIecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment For example, 
recombinant DNA molecules contained in a vector are considered isolated for the 

20 purposes of the present invention. Further examples of isolated DNA molecules 

include recombinant DNA molecules maintained in heterologous host ceUs or 
purified (partially or substantially) DNA molecules in solution. Isolated RNA 
molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present 

25 invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
positions 82-84 of the nucleotide sequence shown in Figure 1 [SEQ ID N0:1] or 
Figure 2 [SEQ ID N0:3]; DNA molecules comprising the coding sequence for the 

30 mature chitinase alpha protein shown in Figure 1 (last 395 amino acids) [SEQ ED 

N0:2] or Figure 2 [SEQ ID N0:4] (last 395 amino acids); DNA molecules 
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comprising an open reading frame (ORF) with an initiation codon at positions 27- 
29 of the nucleotide sequence shown in Figure 5 [SEQ ED NO: II]; DNA 
molecules comprising the coding sequence for the mature chitinase alpha-2 protein 
shown in Figure 5 (last 365 amino acids) [SEQ ID NO: 12]; and DNA molecules 
5 vAich comprise a sequence substantially different from those described above but 

which, due to the degeneracy of the genetic code, still encode the chitinase alpha 
protein. Of course, the genetic code is well known in the art. Thus, it would be 
routine for one skilled in the art to generate the degenerate variants described 
above. 

'0 In another aspect, the invention provides isolated nucleic acid molecules 

encoding the chitinase alpha polypeptide having an amino acid sequence encoded 
by the cDNA clone contained in the plasmid deposited as ATCC Deposit No 
97529 on May 3, 1996. Preferably, this nucleic acid molecule will encode the 
mature polypeptide encoded by the above-described deposited cDNA clone. The 

1 5 invention further provides an isolated nucleic acid molecule having the nucleotide 

sequence shown in Figure 1 [SEQ ID N0:1] or Figure 2 [SEQ ID N0:3] or 
Figure 5 [SEQ ID NO: 11 ] or the nucleotide sequence of the chitinase alpha cDNA 
contained in the above-described deposited clone, or nucleic acid molecule having 
a sequence complementary to one of the above sequences. Such isolated 

20 molecules, particulariy DNA molecules, are useful as probes for gene mapping by 

in situ hybridization with chromosomes and for detecting expression of the 
chitinase alpha gene in human tissue, for instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic acid 

25 molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 

sequence shown in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:3), or Figure 
5 (SEQ ID NO: 11), is intended fragments at least about 15 nt, and more 
preferably at least about 20 nt, still more preferably at least about 30 nt, and even 
more preferably, at least about 40 nt in length which are useful as diagnostic 

30 probes and primers as discussed herein. Of course, larger fragments 50-1400 nt 

in length are also useful according to the present invention as are fragments 
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corresponding to most, if not all, of the nucleotide sequence of the deposited 
cDNA or as shown in Figure 1 (SEQ ID N0:1), Figure 2 (SEQ ED NO:3), or 
Figure 5 (SEQ ID N0:1 1). By a fragment at least 20 nt in length, for example, 
is intended fragments which include 20 or more contiguous bases from the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence as shown 
in Figure 1 (SEQ ID N0:1), Figure 2 (SEQ ID N0:3X or Figure 5 (SEQ ID 
NO:l 1). Since the gene has been deposited and the nucleotide sequences shown 
in Figure 1 (SEQ ID N0:1), Figure 2 (SEQ ID N0:3) and Figure 5 (SEQ ID 
NO: 1 1) are provided, generating such DNA fragments would be routine to the 
skilled artisan. For example, restriction endonuclease cleavage or shearing by 
sonication could easily be used to generate fragments of various sizes. 
Alternatively, such fragments could be generated synthetically. 

Preferred nucleic acid fragments of the present invention include nucleic 
acid molecules encoding epitope-bearing portions of the chitinase alpha and 
chitinase aIpha-2 proteins. In particular, such nucleic acid fragments of the 
present invention include nucleic acid molecules encoding: a polypeptide 
comprising amino acid residues from about 38 to about 51 in Figure 1 (SEQ ED 
N0:2); a polypeptide comprising amino acid residues from about 66 to about 85 
in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from 
about 1 17 to about 127 in Figure 1 (SEQ ID N0:2); a polypeptide comprising 
amino acid residues from about 145 to about 153 in Figure 1 (SEQ ID N0:2); a 
polypeptide comprising amino acid residues from about 165 to about 172 in 
Figure 1 (SEQ ID N0:2); a polypeptide comprising anuno acid residues from 
about 177 to about 185 in Figure 1 (SEQ ID N0:2); a polypeptide comprising 
amino acid residues from about 245 to about 254 in Figure 1 (SEQ ID N0:2); a 
polypeptide comprising amino acid residues from about 256 to about 274 in 
Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from 
about 286 to about 293 in Figure 1 (SEQ ID N0:2); a polypeptide comprising 
amino acid residues from about 366 to about 376 in Figure 1 (SEQ ID N0:2); and 
a polypeptide comprising amino acid residues from about 394 to about 407 in 
Figure 1 (SEQ ID N0:2). 
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The inventors have determined that the above polypeptide fragments are 
antigenic regions of the chitinase alpha protein. Methods for determining other 
such epitope-bearing portions of the chitinase alpha protein are described in detail 
below. 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the cDNA clone contained in ATCC 
Deposit 97529. By "stringent hybridization conditions" is intended overnight 
incubation at 42*C in a solution comprising: 50% formamide, 5x SSC (150 mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 ^g/ml denatured, sheared salmon sperm 
DNA, followed by washing the filters in O.lx SSC at about 65X. By a 
polynucleotide which hybridizes to a "portion" of a polynucleotide is intended a 
polynucleotide (either DNA or RN A) hybridizing to at least about 1 5 nucleotides 
(nt), and more preferably at least about 20 nt, still more preferably at least about 
30 nt, and even more preferably about 30-70 nt of the reference polynucleotide. 
These are usefiji as diagnostic probes and primers as discussed above and in more 
detail below. 

Of course, polynucleotides hybridizing to a larger portion of the reference 
polynucleotide (e.g., the deposited cDNA clone), for instance, a portion 50-750 
nt in length, or even to the entire length of the reference polynucleotide, also 
useful as probes according to the present invention, as are polynucleotides 
corresponding to most, if not all, of the nucleotide sequence of the deposited 
cDNA or the nucleotide sequence as shown in Figure I [SEQ ID NO: 1], Figure 
2 [SEQ ID N0:3] or Figure 5 [SEQ ID NO: 11]. By a portion of a polynucleotide 
of "at least 20 nt in length," for example, is intended 20 or more contiguous 
nucleotides from the nucleotide sequence of the reference polynucleotide, (e.g., 
the deposited cDNA or the nucleotide sequence as shown in Figure 1 [SEQ ID 
NO: 1], Figure 2.[SEQ ID N0:3] or Figure 5 [SEQ ID NO: 11]). As indicated, 
such portions are useful diagnostically either as a probe according to conventional 
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DNA hybridization techniques or as primers for ampiificatton of a target sequence 
by the polymerase chain reaction (PCR), as desribed, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J. et aL, eds., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989), the 
entire disclosure of which is hereby incorporated herein by reference. 

Since a chitinase alpha cDNA clone has been deposited and its detemnined 
nucleotide sequence is provided in Figure I [SEQ ID N0:1] and the nucleotide 
sequence of a variant is provided in Figure 2 [SEQ ID N0:3], generating 
polynucleotides which hybridize to a portion of the chitinase alpha cDNA 
molecule would be routine to the skilled artisan. Similarly, since the determined 
nucleotide sequ^ce for the chitinase alpha-2 protein is provided in Figure 5 [SEQ 
ID NO: 11], generating polynucleotides which h>i)ridize to a portion of the 
chitinase alpha-2 cDNA molecule would be routine to the skilled artisan. For 
example, restriction endonuclease cleavage or shearing by sonication of the 
chitinase alpha cDNA clone or chitinase aIpha-2 cDNA clone could easily be used 
to generate DNA portions of various sizes which are polynucleotides that 
hybridize to a portion of the chitinase alpha or chitinase alpha-2 cDNA molecule. 
Alternatively, the hybridizing polynucleotides of the present invention could be 
generated synthetically according to known techniques. Of course, a 
polynucleotide which hybridizes only to a poly A sequence (such as the 3 ' terminal 
poIy(A) tract of the chitinase alpha cDNA shown in Figure 1 [SEQ ID NO: 1]) or 
Figure 2 [SEQ ID NO: 3], or to a complementary stretch of T (or U) resides, 
would not be included in a polynucleotide of the invention used to hybridize to a 
portion of a nucleic acid of the invention, since such a polynucleotide would 
hybridize to any nucleic acid molecule contain a poly (A) stretch or the 
complement thereof (e.g., practiajly any double-standed cDNA clone). 

As indicated, nucleic acid molecules of the present invention which encode 
the chitinase alpha or chitinase alpha-2 polypeptide may include, but are not 
limited to those encoding the amino acid sequence of the mature polypeptide, by 
itself; the coding sequence for the mature polypeptide and additional sequences, 
such as those encoding the about 26 amino acid leader or secretory sequence, such 
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as a pre-, or pro- or prepro- protein sequence; the coding sequence of the mature 
polypeptide, with or without the aforementioned additional coding sequences, 
together with additional, non*coding sequences, including for example, but not 
limited to introns and non-coding 5' and 3' sequences, such as the transcribed, 
5 non-translated sequences that play a role in transcription, mRNA processing - 

including splicing and polyadenylation signals, for example - ribosome binding and 
stability of mRNA; an additional coding sequence which codes for additional 
amino acids, such as those which provide additional functionalities Thus, the 
sequence encoding the polypeptide may be fused to a marker sequence, such as 

10 a sequence encoding a peptide which facilitates purification of the fused 

polypeptide. In certain preferred embodiments of this aspect of the invention, the 
maricer amino acid sequence is a hexa-histidine peptide, such as the tag provided 
in a pQE vector (Qiagen, Inc.), among others, many of which are commercially 
available. As described in Gentz et al, Proc, Nail Acad, Sci. (USA) <S(5:82 1-824 

15 (1989), for instance, hexa-histidine provides for convenient purification of the 

fusion protein. The "HA" tag is another peptide usefiji for purification which 
corresponds to an epitope derived from the influenza hemagglutinin protein, which 
has been described by Wilson ei al. Cell 37161 (1984). 

The present invention fiirther relates to variants of the nucleic acid 

20 molecules of the present invention, which encode portions, analogs or derivatives 

of the chitinase alpha or chitinase alpha-2 protein. Variants may occur naturally, 
such as a natural allelic variant. By an "allelic variant" is intended one of several 
alternate forms of a gene occupying a given locus on a chromosome of an 
orgnanism. Genes 11^ Lewin, ed. Non-naturally occurring variants may be 

25 produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions. The substitutions, deletions or additions may involve one or more 
nucleotides. The variants may be altered in coding or non-coding regions or both. 
Alterations in the coding regions may produce conservative or non-conservative 

30 amino add substitutions, deletions or additions. Especially preferred among these 

are silent substitutions, additions and deletions, wluch do not alter the properties 
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and activities of the chitinase alpha or chitinase alpha-2 protein or portions 
thereof. Also especially preferred in this regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 90% 
5 identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical to 

(a) a nucleotide sequence encoding the full-length chitinase alpha polypeptide 
having the complete amino acid sequence in Figure 1 [SEQ ID N0:2] or Figure 
2 [SEQ ID N0:4] including the predicted leader sequence; (b) a nucleotide 
sequence encoding the mature chitinase alpha polypeptide (full-length polypeptide 

10 with the leader removed) having the amino acid sequence at positions 27-421 in 

Figure 1 [SEQ ID N0:2] or Figure 2 [SEQ ID N0:4]; a nucleotide sequence 
encoding the full-length chitinase alpha polypeptide having the complete amino 
acid sequence including the leader encoded by the cDNA clone contained in 
ATCC Deposit No. 97529; (d) a nucleotide sequence encoding the mature 

1 5 chitinase alpha polypeptide having the amino acid sequence encoded by the cDN A 

clone contained in ATCC Deposit No. 97529; (e) a nucleotide sequence encoding 
the chitinase alpha-2 polypeptide having the complete amino acid sequence shown 
in Figure 5 [SEQ ED NO: 12]; (f) a nucleotide sequence encoding the mature 
chitinase aIpha-2 polypeptide having the amino acid sequence at positions 21-385 

20 in Figure 5 [SEQ ID NO: 12]; and (g) a nucleotide sequence complementary to any 

of the nucleotide sequences in (a), (b), (c), (d), (e), or (f). 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding an chitinase alpha or 
chitinase alpha-2 polypeptide is intended that the nucleotide sequence of the 

25 polynucleotide is identical to the reference sequence except that the polynucleotide 

sequence may include up to five point mutations per each 100 nucleotides of the 
reference nucleotide sequence encoding the chitinase alpha or chitinase alpha-2 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 

30 the nucleotides in the reference sequence may be deleted or substituted with 

another nucleotide, or a number of nucleotides up to 5% of the total nucleotides 
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in the reference sequence may be inserted into the reference sequence. These 
mutations of the reference sequence may occur at the 5' or 3' terminal positions 
of the reference nucleotide sequence or anywhere between those tenninal 
positions, interspersed either individually among nucleotides in the reference 
5 sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90%, 95%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in Figure 1, Figure 2, or FigureS or to the nucleotides sequence 
of the deposited cDNA clone can be determined conventionally using known 

10 computer programs such as the Bestfit program (Wisconsin Sequence Analysis 

Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, WI 5371 1). Bestfit uses the local homology 
algorithm of Smith and Waterman {Advances in Applied Mathematics 2:482-489 
(1981)) to find the best segment of homology between two sequences. When 

15 using Bestfit or any other sequence alignment program to determine whether a 

particular sequence is, for instance, 95% identical to a reference sequence 
according to the present invention, the parameters are set, of course, such that the 
percentage of identity is calculated over the full length of the reference nucleotide 
sequence and that gaps in homology of up to 5% of the total number of 

20 nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Figure X 
[SEQ ID NO: 1] or Figure 2 [SEQ ID N0:3] or Figure 5 [SEQ ID NO: 1 1] or to 
the nucleic acid sequence of the deposited cDNA, irrespective of whether they 

25 encode a polypeptide having chitinase alpha or chitinase alpha-2 activity. This is 

because, even where a particular nucleic acid molecule does not encode a 
polypeptide having such activity, one of skill in the art would still know how to 
use the nucleic acid molecule, for instance, as a hybridization probe or a 
polymerase chain reaction (PCR) primer. Uses of the nucleic acid molecules of 

30 the present invention that do not encode a polypeptide having chitinase alpha or 

chitinase alpha-2 activity include, inter alia, (1) isolating the chitinase alpha or 
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chitinase aipha-2 gene or allelic variants thereof in a cDNA library; (2) in situ 
hybridtiation (e.g., "FISH") to metaphase chromosomal spreads to provide 
precise chromosomal location of the chitinase alpha or chitinase alpha-2 gene as 
described in Verma ei aL, Human Chromosomes: a Mawial of Basic Techniques, 
Pergamon Press, New York (1988); and Northern Blot analysis for detecting 
chitinase alpha or chitinase alpha-2 mRNA expression in specific cell types (e.g., 
placenta). 

Preferred, however, are nucleic acid molecules having sequences at least 
90%, 95%, 96%, 97%. 98% or 99% identical to the nucleic acid sequence shown 
in Figure 1 [SEQ ID N0:1} or to the nucleic acid sequence of the deposited 
cDNA which do, in fact, encode a polypeptide having chitinase alpha or chitinase 
alpha-2 protein activity. By "a polypeptide having chitinase alpha or chitinase 
alpha-2 activity" is intended polypeptides exhibiting activity similar, but not 
necessarily identical, to an activity of the chitinase alpha or chitinase alpha-2 
protein of the invention (either the full-length protein or, preferably, the mature 
protein) as measured in a particular biological assay. One such assay for chitinase 
activity is the assay described by Dickinson et al, J, Gen. hdicrobioL 755:1417- 
1421 (1989). 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% 
identical to the nucleic acid sequence of the deposited cDNA or the nucleic acid 
sequence shown in Figure 1 [SEQ ID N0:1] or Figure 2 [SEQ ID N0:3] or 
Figure 5 [SEQ ID NO: 1 1] will encode a polypeptide "having chitinase alpha or 
chitinase alpha-2 protein activity " In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing one of the above-described comparison 
assays. It will be further recognized in the art that, for such nucleic acid molecules 
that are not degenerate variants, a reasonable number will also encode a 
polypeptide haying chitinase alpha or chitinase alpha-2 protein activity. This is 
because the skilled artisan is fully aware of amino acid substitutions that are either 
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less likely or not likely to significantly effect protein function (e.g., replacing one 
aliphatic aniino acid with a second aliphatic amino acid). 

For example, guidance concerning how to make phenotypically silent 
amino add subsdtions is provided in Bowie, J.U. et al, "Deciphering the Message 
in Protein Sequences: Tolerance to Amino Acid Substitutions " Science 247: 1306- 
1310 (1990), wherdn the authors indicate that there are two main approaches for 
studying the tolerance of an amino acid sequence to change. The first method 
relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections 
or so^e^ to identify sequences that maintain functionality. As the authors state, 
these studies have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The authors further indicate which amino acid changes are likely to 
be permissive at a certain position of the protein. For example, most buried amino 
acid residues require nonpolar side chains, whereas few features of surface side 
chains are generally conser/ed. Other such phenotypically silent substitutions are 
described in Bowie, J.U., etai, supra, and the references cited therein. 

Thus, in accordance with another aspect of the invention, the following 
nucleotide sequence variations of the chitinase alpha gene in Figure 1 [SEQ ID 
N0:1] are provided: the nucleotides at poshions 607-609 in Figure 1 [SEQ ID 
N0:1] are changed to either GAA or GAG. This results in an amino acid 
substitution of glutamic acid for isoleucine. Figure 2 [SEQ ID N0:3] depicts this 
sequence variant, which is expected to encode a peptide hawng chitinolytic 
activity. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of chitinase alpha 
polypeptides or portions thereof by recombinant techniques. 
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Recombinant constructs may be introduced into host cells using well 
known techniques such as infection, transduction, transfection, transvection, 
electroporation and transformation. The vector may be, for example, a phage, 
plasmid, viral or retroviral vector. Retroviral vectors may be replication 
competent or replication defective. In the latter case, viral propagation generally 
will occur only in complementing host cells. 

The polynucleotides may be joined to a vector contsdning a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising cis-acting control regions to the 
polynucleotide of interest. Appropriate trans-acting factors may be supplied by 
the host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such is 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations 
thereof, such as cosnuds and phagemids. 

The DNA insert should be pperatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E, coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will fiirther contain sites for transcription initiation,termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
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of the mature transcripts expressed by the constnicts will include a translation 
initiating AUG at the beg^ining and a termination codon appropriately positioned 
at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 

5 selectable marker. Such markers include dihydrofolate reductase or neomycin 

resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for cukuring in£ coli and other bacteria. Representative examples of appropriate 
hosts include bacterial cells, such as £. coli, Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila 

10 S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes 

melanoma cells; and plant cells. Appropriate culture media and conditions for the 
above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 

15 vectors, pNHSA, pNH16a, pNHlSA, pNH46A, available from Stratagene; and 

ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. 
Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG avwlable from Stratagene; pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia, and pA2 available from Qiagen. Other suitable vectors will be 

20 readily apparent to the skilled artisan. 

Among knowTi bacterial promoters suitable for use in the present invention 
include the £. coli lad and lacZ promoters, the T3 and T7 promoters, the gpi 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable 
eukaryotic promoters include the CMV immediate early promoter, the HSV 

25 thymidine kinase promoter, the early and late SV40 promoters, the promoters of 

retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and 
metallothionein promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 

30 mediated transfection, electroporation, transduction, infection or other methods. 
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Such methods are described in many standard laboratory manuals, such as Davis 
et al , Basic Methods in Molecular Biology ( 1 986). 

Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes may be increased by inserting an enhancer 
sequence into the vector. Enhancers are cis-acting elements of DNA, usually 
about from 10 to 300 bp that act to increase transcriptional activity of a promoter 
in a given host cell-type. Examples of enhancers include the SV40 enhancer, 
which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovinis early promoter enhancer, the polyoma enhancer on the late side 
of the replication origin, and adenovirus enhancers. 

For secretion of the translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 
appropriate secretion signals may be incorporated into the expressed pol)T3eptide. 
The signals may be endogenous to the polypeptide or they may be heterologous 
signals. 

Thus, the polypeptides of the invention may be expressed in a modified 
fomi, such as a fiision protein, and may include not only secretion signals but also 
additional heterologous functional regions. For instance, a region of additional 
amino adds, particularly charged amino acids, may be added to the N-terminus of 
the polypeptides to improve stability and persistence in the host cell, during 
purification or during subsequent handling and storage. Also, region(s) also may 
be added to the polypeptides to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to fecilitate purification, among others, are familiar and routine techniques in 
the art. A preferred fiision protein comprises a heterologous region fi-om 
immunoglobulin that is usefiil to solubilize polypeptides. For example, EP-A-0 
464 533 (Canadian counterpart 2045869) discloses fiision proteins comprising 
various portions of constant region of immunoglobin molecules together with 
another human protein or part thereof In many cases, the Fc part in fiision 
protein is thoroughly advantageous for use in therapy and diagnosis and thus 
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results, for example, in improved pharmacokinetic properties (EP-A 0232 262). 
On the other hand, for some uses it would be desirable to be able to delete the Fc 
part after the fusion protein has been expressed, detected and purified in the 
advantageous manner described. This is the case when Fc portion proves to be 
a hindrance to use in therapy and diagnosis, for example when the fusion protein 
is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, hIL5- has been fiised with Fc portions for the purpose of high- 
throughput screening assays to identify antagonists of hIL-5. See, Bennett. D. et 
a/., Journal of Molecular Recognition *:52-58 (1995) and Johanson, K. et ai. 
The Journal of Biological Chemistry 270(16):9459-9471 (1995). 

The chitinase alpha or chitinase alpha-2 protein can be recovered and 
purified fi-om recombinant cell cultures by well-known methods including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance 
liquid chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention include naturally purified products, 
products of chemical synthetic procedures, and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, 
bacterial, yeast, higher plant, insect and mammalian cells. Depending upon the 
host employed in a recombinant production procedure, the polypeptides of the 
present invention may be glycosylated or may be non-glycosylated. In addition, 
polypeptides of the invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. 

Chitinase Alpha and Chitinase Alpha-! Polypeptides and Peptides 

The invention further provides an isolated chitinase alpha or chitinase 
alpha-2 polypeptide having the amino acid sequence encoded by the deposited 
cDNA, or the amino acid sequence in Figure 1 [SEQ ID N0:2], Figure 2 [SEQ 
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ID NO:4]. or Figure 5 [SEQ ID NO: 12] or a peptide or polypeptide comprising 
a portion of the above polypeptides. The terms "peptide" and "oligopeptide" are 
considered synonymous (as is commonly recognized) and each term can be used 
interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polypeptide" is used herein for 
chains containing more than ten amino acid residues. All oligopeptide and 
polypeptide formulas or sequences herein are written from left to right and in the 
direction from amino terminus to carboxy terminus. 

It will be recognized in the art that some amino acid sequence of the 
chitinase alpha or chitinase alpha-2 polypeptide can be varied without significant 
effect of the structure or function of the protein. If such differences in sequence 
are contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity. In general, it is possible to replace residues 
which form the tertiary structure, provided that residues performing a similar 
function are used. In other instances, the type of residue may be completely 
unimportant if the alteration occurs at a non-critical region of the protein. 

Thus, the invention further includes variations of the chitinase alpha and 
chitinase alpha-2 polypeptides which show substantial chitinase alpha or chitinase 
alpha-2 polypeptide activity. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions (for example, substituting one 
hydrophilic residue for another, but not strongly hydrophilic for strongly 
hydrophobic as a mle). Small changes or such "neutral" amino acid substitutions 
will generally have little effect on activity. 

Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu and He; interchange of the 
hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 
substitution between the amide residues Asn and Gin, exchange of the basic 
residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. 

A particularly preferred variant of the chitinase alpha in Figure 1 [SEQ ID 
NO: 2] are those which result in an amino acid substitution of glutamic acid for 
isoleucine at amino acid position 176 (i.e., the nucleotides at positions 607 to 609 
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in Figure 1 are changed to either GAA or GAG). Figure 2 [SEQ ID N0:4] 
depicts this preferred sequence variant, which is expected to encode a peptide 
having chitinolytic activity . 

As indicated in detail above, further guidance concerning which amino acid 
5 changes are likely to be phenotypically silent (i.e., are not likely to have a 

significant deleterious effect on a function) can be found in Bowie, J.U., et al, 
^c/en^re 2^7:1306-1310 (1990). 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly 

10 produced version of the chitinase alpha or chitinase alpha-2 polypeptide can be 

substantially purified by the one-step method described in Smith and Johnson, 
Ge/ie 67:31-40 (1988). 

The polypeptides of the present invention include the polypeptide encoded 
by the deposited cDNA including the leader, the mature polypeptide encoded by 

15 the deposited the cDNA minus the leader (i.e., the mature protein), the 

polypeptide of Figure 1 [SEQ ID N0:2], Figure 2 [SEQ ID N0:4] or Figure 5 
[SEQ ID NO: 12] including the leader, the polypeptide of Figure 1 [SEQ ID 
N0:2], Figure 2 [SEQ ID NO:4] or Figure 5 [SEQ ID NO: 12] minus the leader, 
as well as polypeptides which have at least 90% similarity, more preferably at least 

20 95% similarity, and still more preferably at least 97%, 98% or 99% similarity to 

those described above. Further polypeptides of the present invention include 
polypeptides at least 80% identical, more preferably at least 90% or 95% identical, 
still more preferably at least 97%, 98% or 99% identical to the polypeptide 
encoded by the deposited cDNA, to the polypeptide of SEQ ED N0:2, SEQ ID 

25 N0:4, or SEQ ID NO: 12 and also include portions of such polypeptides with at 

least 30 amino acids and more preferably at least SO amino acids. 

By "% similarity" for two polypeptides is intended a similarity score 
produced by comparing the amino acid sequences of the two polypeptides using 
theBestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 

30 Genetics Computer Group, University Research Park, 575 Science Drive, 

Madison, WI 5371 1) and the default settings for determining similarity. Bestfit 
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uses the local homology algorithm of Smith and Waterman (Ad\fances in Applied 
Mathematics 2:482-489 (1981)) to find the best segment of similarity between 
two sequences. 

By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a chitinase alpha or 
chitinase alpha-2 polypeptide is intended that the amino acid sequence of the 
polypeptide is identical to the reference sequence «cept that the polypeptide 
sequence may include up to five amino acid alterations per each 100 amino acids 
of the reference amino acid of the chitinase alpha polypeptide. In other words, to 
obtain a polypeptide having an anuno acid sequence at least 95% identical to a 
reference amino acid sequence, up to 5% of the amino acid residues in the 
reference sequence may be deleted or substituted with another amino acid, or a 
number of amino acids up to 5% of the total amino acid residues in the reference 
sequence may be inserted into the reference sequence. These alterations of the 
reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one 
or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in Figure 1 [SEQ ID N0:2], Figure 2 [SEQ ID N0:4] or Figure 5 [SEQ ID 
NO: 121, or to the amino acid sequence encoded by deposited cDNA clone, can 
be determined conventionally using known computer programs such the Bestfit 
program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics 
Computer (jroup. University Research Park, 575 Science Drive, Madison, WI 
53711). When using Bestfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of the 
reference amino acid sequence and that gaps in homology of up to 5% of the total 
number of amino acid residues in the reference sequence are allowed. 
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As described in detail below, the polypeptides of the present invention can 
be used to raise polyclonal and monoclonal antibodies, which are useful in 
diagnostic assays for detecting chitinase alpha and/or chitinase alpha-2 protein 
expression as described below or as agonists and antagonists capable of enhancing 
5 or inhibiting protein function. Further, such polypeptides can be used in the yeast 

two-hybrid system to ^'capture" chitinase alpha or chitinase alpha-2 protein binding 
proteins which are also candidate agonist and antagonist according to the present 
invention. The yeast two hybrid system is described in Fields and Song, Nature 
340:245-246(1989). 

10 In another aspect, the invention provides a peptide or polypeptide 

comprising an epitope-bearing portion of a polypeptide of the invention. The 
epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide of the invention. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein is the 

15 immunogen. These immunogenic epitopes are believed to be confined to a few 

loci on the molecule. On the other hand, a region of a protein molecule to which 
an antibody can bind is defined as an ''antigenic epitope.'' The number of 
immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, for instance, Geysen, H.M. et ai, Proc. NatL Acad Sci. (USA) 

20 57:3998-4002 (1984). 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
(i.e., that contain a region of a protein molecule to which an antibody can bind), 
it is well known in that art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 

25 with the partially mimicked protein. See, for instance, SutclifFe, J.G. et al. 

Science 279:660-666 (1983). Peptides capable of eliciting protein-reactive sera 
are frequently represented in the primary sequence of a protein, can be 
characterized by a set of simple chemical rules, and are confined neither to 
immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to the 

30 amino or carboxyl terminals. Peptides that are extremely hydrophobic and those 

of six or fewer residues generally are ineffective at inducing antibodies that bind 
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to the mimicked protein; longer, soluble peptides, especially those containing 
proline residues, usually are eflFective, Sutcliffe ei ai , supra, at 66 1 . For instance, 
18 of 20 peptides designed according to these guidelines, containing 8-39 residues 
covering 75% of the sequence of the influenza virus hemagglutinin HAl 
polypq)tide chain, induced antibodies that reacted with the HAl protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. Thus, a high proportion of 
hybridomas obtained by fusion of spleen cells from donors immunized with an 
antigen epitope-bearing peptide generally secrete antibody reactive with the native 
protein. SutclifFe etaL, supra, at 663. The antibodies raised by antigenic epitope- 
bearing peptides or polypeptides are useful to detect the mimicked protein, and 
antibodies to different peptides may be used for tracking the fate of various 
regions of a protein precursor which undergoes posttranslation processing. The 
peptides and anti-peptide antibodies may be used in a variety of qualitative or 
quantitative assays for the mimicked protein, for instance in competition assays 
since it has been shown that even short peptides (e.g., about 9 amino acids) can 
bind and displace the larger peptides in immunoprecipitation assays. See, for 
instance, Wilson, LA. et al. Cell 37:161-11% (1984) at 777. The anti-peptide 
antibodies of the invention also are useful for purification of the mimicked protein, 
for instance, by adsorption chromatography using methods well known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
designed according to the above guidelines preferably contain a sequence of at 
least seven, more preferably at least nine and most preferably between about 15 
to about 30 ammo acids contained within the amino acid sequence of a 
polypeptide of the invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of a polypeptide of the invention, 
containing about 30 to about 50 amino acids, or any length up to and including the 
entire amino acid sequence of a polypeptide of the invention, also are considered 
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epitope*bearing peptides or polypeptides of the invention and also are useful for 
inducing antibodies that react with the mimicked protein. Preferably, the amino 
acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 

5 re^dues and highly hydrophobic sequences are prefi^abiy avoided); and sequences 

containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be 
used to generate chitinase alpha- or chitinase alpha-2-specific antibodies include: 
a polypeptide comprising amino acid residues from about 38 to about 51 in 

10 Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from 

about 66 to about 85 in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino 
acid residues from about 117 to about 127 in Figure 1 (SEQ ID N0:2); a 
polypeptide comprising amino acid residues from about 145 to about 153 in 
Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from 

15 about 165 to about 172 in Figure I (SEQ ID N0:2); a polypeptide comprising 

amino add residues from about 177 to about 185 in Figure 1 (SEQ ID N0:2); a 
polypeptide comprising anuno acid residues from about 245 to about 254 in 
Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from 
about 256 to about 274 in Figure 1 (SEQ ID N0:2); a polypeptide comprising 

20 amino acid residues from about 286 to about 293 in Figure 1 (SEQ ID N0:2); a 

polypeptide comprising amino acid residues from about 366 to about 376 in 
Figure 1 (SEQ ID N0:2); and a polypeptide comprising amino acid residues from 
about 394 to about 407 in Figure I (SEQ ID N0:2). 

The epitope-bearing peptides and polypeptides of the invention may be 

25 produced by any conventional means for making peptides or polypeptides 

including recombinant means using nucleic acid molecules of the invention. For 
instance, a short epitope-bearing amino acid sequence may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and 
purification, as well as during immunization to produce anti-peptide antibodies. 

30 Epitope-bearing- peptides also may be synthesized using known methods of 

chemical synthesis. For instance. Houghten has described a simple method for 
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synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 
residue peptides representing single amino acid variants of a segment of the HAl 
polypeptide which were prepared and characterized (by ELISA-type binding 
studies) in less than four weeks. Houghten, R. A.. Proc. Natl Acad ScL (USA) 
52:513 1-5135 (1985). This "Simultaneous Multiple Peptide Synthesis (SMPS)" 
process is further described in U.S. Patent No. 4,631,211 to Houghten et ai 
(1986). In this procedure the individual resins for the solid-phase synthesis of 
various peptides are contained in separate solvent-permeable packets, enabling the 
optimal use of the many identical repetitive steps involved in solid-phase methods. 
A completely manual procedure allows SOO-1000 or more syntheses to be 
conducted simultaneously. Houghten et al, supra, at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to 
induce antibodies according to methods well known in the art. See, for instance, 
Sutcliflfe et al. , supra; Wilson et al, supra. Chow, M. et aL , Proc. Nail Acad Sci 
(USA) «2:910-914; and Bittle, F.J. etai, I Gen, Virol 66:2347-2354 (1985). 
Generally, animals may be immunized with free peptide; however, anti-peptide 
antibody titer may be boosted by coupling of the peptide to a macromolecular 
carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine may be coupled to carrier using a linker such as m- 
maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 
be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or carrier- 
coupled peptides, for instance, by intraperitoneal and/or intradermal injection of 
emulsions containing about 100 ^g peptide or carrier protein and Freund's 
adjuvant. Several booster injections may be needed, for instance, at intervals of 
about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid 
surface. The titer of anti-peptide antibodies in serum from an immunized animal 
may be increased by selection of anti-peptide antibodies, for instance, by 
adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 
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Immunogenic epitope-bearing peptides of the invention, i.e., those parts 
of a protein that elicit an antibody response when the whole protein is the 
inununogen, are identified according to methods known in the art. For instance, 
Geysen et al, 1984, supra, discloses a procedure for rapid concurrent synthesis 
on solid supports of hundreds of peptides of sufficient purity to react in an 
enzyme-linked immunosorbent assay. Interaction of synthesized peptides with 
antibodies is then easily detected without removing them from the support. In this 
manner a peptide bearing an inrununogenic epitope of a desired protein may be 
identified routinely by one of ordinary skill in the art. For instance, the 
immunologically important epitope in the coat protein of foot-and-mouth disease 
virus was located by Geysen et al with a resolution of seven amino acids by 
synthesis of an overtyping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of 
peptides in which all 20 amino acids were substituted in turn at every position 
within the epitope were synthesized, and the particular amino acids conferring 
specificity for the reaction with antibody were determined. Thus, peptide analogs 
of the epitope-bearing peptides of the invention can be made routinely by this 
method. U.S. Patent No. 4,708,781 to Geysen (1987) further describes this 
method of identifying a peptide bearing an immunogenic epitope of a desired 
protein. 

Further still, U.S. Patent No. 5,194,392 to Geysen (1990) describes a 
general method of detecting or determining the sequence of monomers (amino 
acids or other compounds) which is a topological equivalent of the epitope (i.e., 
a "mimotope") which is complementary to a particular paratope (antigen binding 
site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092 to 
Geysen (1989) describes a method of detecting or determining a sequence of 
monomers which is a topographical equivalent of a ligand which is complementary 
to the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent 
No. 5,480,971 to Houghten, R.A. et al (1996) on Peralkylated Oligopeptide 
Mixtures discloses linear Ci-C7-alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 



35 

libraries for determining the sequence of a perallcylated oligopeptide that 
preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs 
of the epitope-bearing peptides of the invention also can be made routinely by 
these methods. 

5 The entire disclosure of each document cited in this section on 

"Polypeptides and Peptides" is hereby incorporated herein by reference 

As one of skill in the art will appreciate, chitinase alpha and chitinase 
alpha-2 polypeptides of the present invention and the epitope-bearing fragments 
thereof described above can be combined with parts of the constant domain of 

1 0 immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 

facilitate purification and show an increased half-life in vivo. This has been 
shown, e.g., for chimeric proteins consisting of the first two domains of the human 
CD4-polypeptide and various domains of the constant regions of the heavy or light 
chains of mammalian inununoglobulins (EPA 394,827; Traunecker et aL, Nature 

1 5 i37:84- 86 (1988)), Fusion proteins that have a disulfide-linked dimeric structure 

due to the IgG part can also be more efficient in binding and neutralizing other 
molecules than the monomeric protein or protein firagment alone (Fountoulakis et 
ai, JBiochem 270:3958-3964 (1995)). 



Chromosome Assays 



20 The nucleic acid molecules of the present invention are also valuable for 

chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 

25 (repeat polymorphisms) are presently available for marking chromosomal location. 

The mapping of DNAs to chromosomes according to the present invention is an 
important first step in correlating those sequences with genes associated with 
disease.. 
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In certain preferred embodiments in this regard, the cDNA herein disclosed 
is used to clone genomic DNA of a chitinase alpha protein gene. This can be 
accomplished using a variety of well known techniques and libraries, which 
generally are available commercially. The genomic DNA then is used for in situ 
chromosome mapping using well known techniques for this purpose. Typically, 
in accordance with routine procedures for chromosome mapping, some trial and 
error may be necessary to identify a genomic probe that gives a good in situ 
hybridization signal. 

In some cases, in addition, sequences can be mapped to chromosomes by 
preparing PCR primers (preferably 15-25 bp) from the cDNA. Computer analysis 
of the 3' untranslated region of the gene is used to rapidly select primers that do 
not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the primer wilt yield an amplified 
portion. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular DNA to a particular chromosome. Using the present invention with the 
same oligonucleotide primers, sublocalization can be achieved with panels of 
portions from specific chromosomes or pools of large genomic clones in an 
analogous manner. Other mapping strategies that can similarly be used to map to 
its chromosome include in situ hybridization, prescreening with labeled flow- 
sorted diromosomes and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a 
metaphase chromosomal spread can be used to provide a precise chromosomal 
location in one step. This technique can be used with probes from the cDNA as 
short as 50 or 60 bp. For a review of this technique, see Verma et al. Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York 
(1988). 
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Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man^ available on-line through Johns Hopkins University, Welch 
Medical Library. The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed 
in some or all of the afifected individuals but not in any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic mapping 
techniques, a cDNA precisely localized to a chromosomal region associated with 
the disease could be one of between 50 and 500 potential causative genes. (This 
assumes 1 megabase mapping resolution and one gene per 20 kb). 

Identification of Receptor for the Chitinase Alpha or Chitinase Alpha-! 
Ligand 

This invention fiirther provides a method for identification of the receptor 
for the chitinase alpha or chitinase alpha-2 ligand. The gene encoding the receptor 
can be identified by numerous methods known to those of skill in the art, for 
example, ligand panning and FACS sorting (Coligan, et ai, Current Protocols in 
Immun. 1(1), Chapter 5, (1991)). Preferably, expression cloning is employed 
wherein polyadenylated RNA is prepared from a cell responsive to the chitinase 
alpha or chitinase alpha-2 ligand, and a cDNA library created firom this RNA is 
divided into pools and used to transfect COS cells or other cells that are not 
responsive to the chitinase alpha or chitinase alpha-2 ligand. Transfected cells 
which are grown on glass slides are exposed to labeled chitinase alpha or chitinase 
alpha-2 ligand, wWch can be labeled by a variety of means including iodination or 
inclusion of a recognition site for a site-specific protein kinase. Following fixation 
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and incubation, the slides are subjected to auto-radiographic analysis. Positive 
pools are identified and sub-pools are prepared and re-transfected using an 
iterative sub-pooling and re-screening process, eventually yielding a single clone 
that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled ligand can 
be photoaflSnity linked with cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by PAGE and exposed to X- 
ray film. The labeled complex containing the ligand-receptor can be excised, 
resolved into peptide fi-agments, and subjected to protein microsequencing. The 
amino acid sequence obtained from microsequencing would be used to design a 
set of degenerate oligonucleotide probes to screen a cDNA library to identify the 
gene encoding the putative receptor. 

The method for determining whether a ligand can bind to the chitinase 
alpha or chidnase alpha-2 receptor comprises transfecting a cell population (one 
presumed not to contain the receptor) with the appropriate vector expressing the 
chitinase alpha or chitinase alpha-2 receptor, such that the cell will now express 
the recq)tor. A suitable response system is obtained by transfection of the DNA 
into a suitable host containing the desired second messenger pathways including 
cAMP, ion channels, phosphoinositide kinase, or calcium response. Such a 
transfection system provides a response system to analyze the activity of various 
ligands exposed to the cell. Ligands chosen could be identified through a rational 
^proach by taking known ligands that interact with similar types of receptors or 
using small molecules, cell supematants or extracts or natural products. 

The present invention fiirther provides a method of screening drugs to 
identify those which block (antagonists) interaction of ligand to receptor. A 
chitinase alpha or chitinase alpha-2 antagonist is a compound which eliminates or 
reduces the natural biological functions of chitinase alpha or chitinase alpha-2. As 
an example, a mammalian cell or membrane preparation expressing the chitinase 
alpha or chitinase alpha-2 receptor would be incubated with labeled ligand in the 
presence of the drug. The ability of the drug to block this interaction could then 
be measured. Alternatively, the response of a known second messenger system 
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following interaction of ligand and receptor would be measured and compared in 
the presence or absence of the dmg. Such second messenger systems include but 
are not limited to, cAMP guanylate cyclase, ion channels or phosphoinositide 
hydrolysis. 

5 Potential antagonists include an antibody, or in some cases, an 

oligopeptide, which binds to the polypeptide. Alternatively, a potential antagonist 
may be a closely related protein which binds to the receptor sites, however, they 
are inactive forms of the polypeptide and thereby prevent the action of chitinase 
alpha or chitinase alpha'2 since receptor sites are occupied. 

10 Chitinase alpha or chitinase alpha-2 can also be used in the production of 

antibodies. The polypeptides, their fragments or other derivatives, or analogs 
thereof, or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or 
monoclonal antibodies. The present invention also includes chimeric, single chain, 

15 and humanized antibodies, as well as Fab fragments, or the product of an Fab 

expression library. Various procedures known in the art may be used for the 
production of such antibodies and fragments. 

Antibodies generated against the polypeptides coiresponding to a sequence 
of the present invention can be obtained by direct injection of the polypeptides into 

20 an animal or by administering the polypeptides to an animal, preferably a 

nonhuman. The antibody so obtained will then bind the polypeptides itself In this 
manner, even a sequence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native polypeptides. Such 
antibodies can then be used to isolate the polypeptide from tissue expressing that 

25 polypeptide. 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, Nature 25^:495-497 
(1975)), the trioma technique, the human B-cell hybridoma technique (Kozbor et 

30 a/,. Immunology Today 4:12 (1983)), and the EBV-hybridoma technique to 
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produce human monoclonal antibodies (Colee/ a/., in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic chitinase alpha or chitinase alpha-2 products of this invention. Also, 
transgenic mice may be used to express humanized antibodies to immunogenic 
chitinase alpha or chitinase alpha-2 products of this invention. 

Another potential antagonist is an antisense construct prepared using 
antisense technology. Antisense technology can be used to control gene 
expression through triple-helix formation or antisense DNA or RNA, both of 
which methods are based on binding of a polynucleotide to DNA or RNA. For 
example, the 5' coding portion of the polynucleotide sequence, which encodes for 
the mature polypeptides of the present invention, is used to design an antisense 
RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA 
oligonucleotide is designed to be complementary to a region of the gene involved 
in transcription (triple helix-see Lee et ai, Nucl Acids Res. 6:3073 (1979); 
Cooney et ai. Science 241:456 (1988); and Dervan et al. Science 257:1360 
(1991)), thereby preventing transcription and the production of chitinase alpha. 
The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks 
translation of the mRNA molecule into chitinase alpha or chitincase alpha-2 
polypeptide (Antisense - Okano, J. Neurochem, 5(5:560 (1991); 
Oligodeoxynucleoiides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). The oligonucleotides described above can also be 
delivered to cells such that the antisense RNA or DNA may be expressed in vivo 
to inhibit production of chitinase alpha or chitinase alpha-2. 

Potential antagonists also include a small molecule which binds to and 
occupies the catalytic site of the polypeptide thereby making the catalytic site 
inaccessible to substrate such that normal biological activity is prevented. 
Examples of small molecules include, but are not limited to, small peptides or 
peptide-like molecules. 
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Therapeutic Applications of Chitinase Alpha, Chitinase Alpha-! and 
Antagonists Thereof 



Treatment of pathological conditions characterized by elevated 
plasma levels of chitinase alpha or chitinase alpha-! 



5 The antagonists, described above, may be employed to treat rheumatoid 

arthritis, atherosclerosis, inflammation, or other tissue remodeling disease 
processes, that may be characterized by elevated plasma levels of chitinase alpha 
and/or chitinase alpha-2. The antagonists may be employed in a composition with 
a pharmaceutically acceptable carrier, e.g., as hereinafter described, 

10 It is believed that human chitotriosidase is involved in tissue remodeling. 

Human chitotriosidase is homologous to microbial chitinases. In particular, the 
protein shares a similar cysteme motif and homology to the active site of microbial 
chitinases. Accordingly, chitotriosidase can be used in the development of 
treatments of tissue remodeling diseases and in the diagnosis of these diseases. 

15 Similarly, human chitinase alpha or chitinase alpha-2 can be used the same way. 

It is believed that human serum chitotriosidase is an extracellular matrix 
remodeling factor involved in the remodeling process occurring in diseases. 
Activity against chitotriosidase substrates is indicative of a function in degrading 
extracellular matrix substrates with a similar carbohydrate structure to chitin. 

20 Chitotriosidase was found during morphological differentiation of monocytes into 

macrophages. These macrophages can be associated with different sites of 
inflammation that occur in various disease states, including but not limited to 
atherosclerosis, rheumatoid arthritis, acute inflanunatory disease, and inflammation 
due to injury. 

25 Accordingly, it is believed that human serum chitotriosidase, and now 

chitinase alpha and chitinase aIpha-2, are useful targets for the inhibition of tissue 
remodeling disease processes, Purified human chitinase alpha or chitinase alpha-2 
protein could thus be tested for various in vitro assays for tissue remodeling to 
determine the extent to which this protein promotes these processes. Therapeutics 

30 would be defined as molecules that specifically inhibit chitinase alpha or chitinase 
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alpha-2 function as it relates to tissue remodeling activity in these assays and 
includes neutralizing antibodies as described above. The invention also relates to 
methods of treating a patient with a tissue remodeling disorder by administering 
a therapeutically effective amount of a chitinase alpha or chitinase alpha-2 
5 antagonist or antibody. 

Treatment of fungal infection 

In addition, the invention relates to methods of treating fungal infection by 
administering a therapeutically effective amount of chitinase alpha or chitinase 
alpha-2 to a patient. Chitinases expressed in human tissues are expected to confer 
10 protection against fungi since chitin is a component of the fiingal cell wall, and 

chitinase cleaves this polymer. Preferably, the chitinase alpha variant having an 
amino acid sequence shown in Figure 2 (SEQ ID NO: 4) is used for treating 
fungal infection. 

Modes of administration 

15 Given the activities modulated by chitinase alpha and chitinase alpha-2, it 

is readily apparent that a substantially altered (increased or decreased) level of 
expression of these proteins in an individual compared to the standard or "normal" 
level produces pathological conditions such as those described above. It will also 
be appreciated by one of ordinary skill that, since the chitinase alpha and chitinase 

20 alpha-2 proteins of the invention are translated with a leader peptide suitable for 

secretion of the mature protein from cells, when chitinase alpha or chitinase alpha- 
2 protein (particularly the mature form) is added fi-om an exogenous source to 
cells, tissues or the body of an individual, the protein will exert its modulating 
activities on any of its target cells. Therefore, it will be appreciated that conditions 

25 caused by a decrease in the standard or normal level of chitinase alpha or chitinase 

alpha-2 activity in-an individual, can be treated by administration of these proteins. 
Thus, the invention further provides a method of treating an individual in need of 
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an increased level of chitinase alpha or chitinase alpha-2 activity comprising 
administering to such an individual a pharmaceutical composition comprising an 
amount of an isolated chitinase alpha and/or chitinase aIpha-2 polypeptide of the 
invention, particularly the mature form, effective to increase the chitinase alpha 
and/or chitinase alpha-2 activity level in such an individual. 

The chitninase alpha and/or chitinase polypeptide composition to be used 
in the therapy will be formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical condition of the individual patient 
(especially the side effects of treatment with chitinase alpha polypeptide alone), 
the site of delivery of the composition, the method of administration, the 
scheduling of administration, and other factors known to practitioners. The 
"eflFective amount" of chitinase alpha or chitinase alpha-2 polypeptide for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of 
chitinase alpha or chitinase aIpha-2 polypeptide administered parenteraily per dose 
will be in the range of about 1 fig/kg/day to 1 0 mg/kg/day of patient body weight, 
although, as noted above, this will be subject to therapeutic discretion. More 
preferably, this dose is at least 0.01 mg/kg/day, and most preferably for humans 
between about 0.01 and 1 mg/kg/day for the hormone. If given continuously, the 
chitinase alpha or chitinase alpha-2 polypeptide is typically administered at a dose 
rate of about 1 ng/kg/hour to about 50 ng/kg/hour, either by 1-4 injections per 
day or by continuous subcutaneous infusions, for example, using a mini-pump. An 
intravenous bag solution may also be employed. 

The chitinase alpha or chitinase alpha-2 polypeptide is also suitably 
administered by sustained-release systems. Suitable examples of sustained-release 
compositions include semi-permeable polymer matrices in the form of shaped 
articles, e.g., films, or mirocapsules. Sustained-release matrices include 
polylactides (U.S. Pat. No. 3,773,919, EP 58,481), copolymers of L-glutamic acid 
and gamma-ethyl-L-glutamate (Sidman, U. et aL, Biopolymers 22:547-556 
(1983)), poly (2- hydroxyethyl methacrylate) (R. Langer etaL, 1 Biomed Mater 
Res, 15.\(>1'211 (1981). and R. Langer, Chem. Teck 72:98-105 (1982)), ethylene 
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vinyl acetate (R. Langer et ai, Id.) or poly-D- (-)-3-hydroxybutyric acid (EP 
133,988). Sustained-release chitinase alpha or chitinase alpha-2 polypeptide 
compositions also include liposonialiy entrapped polypeptides. Liposomes 
containing chitinase alpha or chitinase alpha-2 polypeptide are prepared by 
5 methods known per se\ DE 3,218,121; Epstein ei al, Proc. Nati Acad ScL 

(USA) 52:3688-3692 (1985); Hwang et al, Proc. Natl Acad ScL (USA) 
77:4030^034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-118008; U.S. Pat. Nos, 4,485,045 and 
4,544,545; and EP 102,324. Ordinarily, the liposomes are of the small (about 

10 200-800 Angstroms) unilamellar type in which the lipid content is greater than 

about 30 mol. percent cholesterol, the selected proportion being adjusted for the 
optimal chitinase alpha or chitinase alpha-2 polypeptide therapy. 

For parenteral administration, in one embodiment, the chitinase alpha or 
chitinase alpha-2 polypeptide is formulated generally by mixing it at the desired 

15 degree of purity, in a unit dosage injectable form (solution, suspension, or 

emulsion), with a pharmaceutically acceptable carrier, i.e., one that is non-toxic 
to recipients at the dosages and concentrations employed and is compatible with 
other ingredients of the formulation. For example, the formulation preferably does 
not include oxidizing agents and other compounds that are known to be 

20 deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably 
the carrier is a parenteral carrier, more preferably a solution that is isotonic with 

25 the blood of the recipient. Examples of such carrier vehicles include water, saline, 

Ringer's solution, and dextrose solution. Non-aqueous vehicles such as fixed oils 
and ethyl oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 

30 toxic to recipients at the dosages and concentrations employed, and include 

buffers such as phosphate, citrate, succinate, acetic acid, and other organic acids 
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or their salts; antioxidants such as ascorbic acid; low molecular weight (less than 
about ten residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such 
as s rum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutanuc acid, aspartic acid, 
5 or arginine; monosaccharides, disaccharides, and other carbohydrates including 

cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The chitinase alpha and chitinase aipha-2 polypeptides are typically 

10 formulated in such vehicles at a concentration of about 0. 1 mg/ml to 100 mg/ml, 

preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be understood that the use 
of certain of the foregoing excipients, carriers, or stabilizers will result in the 
formation of polypeptide salts. 

Polypeptides to be used for therapeutic administration must be sterile. 

15 Sterility is readily accomplished by filtration through sterile filtration membranes 

(e.g., 0.2 micron membranes). Therapeutic chitinase alpha and chitianse aipha-2 
polypeptide compositions generally are placed into a container having a sterile 
access port, for example, an intravenous solution bag or vial having a stopper 
pierceabie by a hypodermic injection needle. 

20 Chitinase alpha and chitinase alpha-2 polypeptides ordinarily will be stored 

in unit or multi-dose containers, for example, sealed ampoules or vials, as an 
aqueous solution or as a lyophilized formulation for reconstitution. As an example 
of a lyophilized formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% 
(w/v) aqueous chitinase alpha polypeptide solution, and the resulting mixture is 

25 lyophilized. The infiision solution is prepared by reconstituting the lyophilized 

chitinase alpha polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one 
or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice 

30 in the fomi prescribed by a governmental agency regulating the manufacture, use 

or sale of pharmaceuticals or biological products, which notice reflects approval 
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by the agency of manufacture, use or sale for human administration. In addition, 
the polypeptides of the present invention may be employed in conjunction with 
other therapeutic compounds. 

Example 1 

5 Bacterial Expression and Purification of Chitinase Alpha 

The DNA sequence encoding chitinase alpha, ATCC No. 97529, is initially 
amplified using PCR oligonucleotide primers corresponding to the 5* sequences 
of the processed chitinase alpha protein (minus the signal peptide sequence) and 
the vector sequences 3' to the gene. Additional nucleotides containing restriction 

10 sites to facilitate cloning are added to the 5' and 3* sequences respectively. 

The 5* oligonucleotide primer has the sequence 
CGCGGATCCATGGGAGCAACCACCATGGACCAG (SEQ ID NO: 5) and 
contains a BaimHI restriction site followed by 24 nucleotides of chitinase alpha 
coding sequence starting from the presumed terminal amino acid of the processed 

1 5 protein codon. 

The 3* primer sequence CGCGGATCCGCTTCTCTGTAAGTTAATCCT 
(SEQ ID N0:6) contains complementary sequences to the BamHI site and is 
followed by 21 nucleotides of chitinase-alpha DNA insert. The restriction sites 
correspond to the restriction enzyme sites on the bacterial expression vector pQE- 

20 9. (Qiagen, Inc. 9259 Eton Avenue, Chatsworth, CA, 91311). pQE-9 encodes 

antibiotic resistance (Amp'), a bacterial origin of replication (ori), an IPTG- 
regulatable promoter operator (P/0), a ribosome binding site (RBS), a 6-His tag 
and restriction enzyme sites. pQE-9 is then digested with BamHI. 

The amplified sequences are ligated into pQE-9 and inserted in frame with 

25 the sequence encoding for the histidine tag and the RBS. The ligation mixture is 

then used to transform the E. coli strain available from Qiagen under the 
trademark M157rep 4 by the procedure described in Sambrook, J. et ai. 
Molecular Cloning: A Laboratory Manual, Cold Spring Laboratory Press, 
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(1989). M15/rep4 contains multiple copies of the plasmid pREP4, which 
expresses the lad repressor and also confers kanamycin resistance (KanO- 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies were selected. Plasmid DNA is isolated 
and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (0/N) in 
liquid culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 
fig/ml). The 0/N culture is used to inoculate a large culture at a ratio of 1 : 100 to 
1 :250. The cells were grown to an optical density 600 (O.D.*^) of between 0.4 
and 0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranoside") is then added to a final 
concentration of 1 mM, EPTG induces by inactivating the lad repressor, clearing 
the P/0 leading to increased gene expression. Cells are grown an extra 3 to 4 
hours. Cells were then harvested by centrifijgation. The cell pellet is solubilized 
in the chaotropic agent 6 Molar Guanidine HCI. After clarification, solubilLzed 
chitinase alpha is purified fi'om this solution by chromatography on a Nickel- 
Chelate column under conditions that allow for tight binding by proteins 
containing the 6-His tag (Hochuli, E. et ai, 1 Chromatography 4 J 1:177 ASA 
(1984)). Chitinase-alpha (>90% pure) is eluted fi-om the column in 6 molar 
guanidine HCI pH 5.0 and for the purpose of renaturation adjusted to 3 molar 
guanidine HCI, lOOmM sodium phosphate, 10 mmolar glutathione (reduced) and 
2 mmolar glutathione (oxidized). After incubation in this solution for 12 hours, 
the protein is dialyzed to 10 mmolar sodium phosphate. 

Example 2 

Qaning and Expression of Chitinase Alpha in Baculovirus 

The DNA sequence encoding the full length chitinase alpha protein, ATCC 
No. 97529, is amplified using PCR oligonucleotide primers corresponding to the 
5' and 3' sequences of the gene: 

The 5' primer has the sequence 
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CGCGGATCCGCCATCATGGGAGCAACCACCATG (SEQ ID NO: 7) and 
contains a BamHI restriction site followed by 6 nucleotides resembling an 
efficient signal for the initiation of tratislation in eukaryotic cells (Kozak, M., J. 
MoL Biol 796:947-950 (1987) which is just behind the first 280 nucleotides of the 
chitinase-alpha gene (the initiation codon for translation "ATG" is underlined). 

The 3* primer has the sequence 
CGCGGATCCGCTTCTCTGTAAGTTAATCCT (SEQ ID NO: 8) and contains 
the cleavage site for the restriction endonuclease BamHI and 21 nucleotides 
complementary to the 3' non-translated sequence of the chitinase-alpha gene. The 
amplified sequences are isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La JoUa, Ca.). The fragment is then 
digested with the endonuclease BamHI, and then purified again on a 1% agarose 
gel This fi-agment is designated F2. 

The vector pA2 (modification of pVL941 vector, discussed below) is used 
for the expression of the chitinase alpha protein using the baculovirus expression 
system (for review see: Summers, M.D. and Smith, G.E. A manual of methods for 
baculovirus vectors and insect cell culture procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555 (1987)). This expression vector contains 
the strong polyhedrin promoter of the Autographa califomica nuclear polyhedrosis 
virus (AcMNPV) followed by the recognition sites for the restriction endonuclease 
BamHI. The polyadenylation site of the simian vims (SV)40 is used for efficient 
polyadenylation. For an easy selection of recombinant vinis the beta-galactosidase 
gene fi-om E,coli is inserted in the same orientation as the polyhedrin promoter 
followed by the polyadenylation signal of the polyhedrin gene. The polyhedrin 
sequences are flanked at both sides by viral sequences for the cell-mediated 
homologous recombination of co-transfected wild-type viral DNA. Many other 
baculovirus vectors could be used in place of pRGl such as pAc373, pVL941 and 
pAcIMl (Luckow, V.A. and Summers, M.D., Virology 17031-39 ). 

The plasmid is digested with the restriction en2yme BamHI and then 
dephosphorylated using calf intestinal phosphatase by procedures known in the 
art. The DNA is then isolated from a 1% agarose gel using the commercially 
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available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). This vector DNA is 
designated V2. 

Fragment F2 and the dephosphorylated plasnud V2 is ligated with T4 
DNA ligase. Kcoli HBlOl cells were then transformed and bacteria identified 
that contained the plasmid (pBac chitinase alpha) with the chitinase-alpha gene 
using the enzyme BanJHDl. The sequence of the cloned firagment is confirmed by 
DNA sequencing. 

Five/ig of the plasmid pBacchitinase alpha is co-transfected with 1.0 /zg 
of a commercially available linearized baculovirus ("BaculoGold™ baculovirus 
DNA", Phamiingen, San Diego, CA.) using the lipofection method (Feigner et al, 
Proc. NatL Acad Scl (USA) 5^:7413-7417 (1987)). 

One/zg of BaculoGold™ virus DNA and 5 //g of the plasmid pBac 
chitinase alpha are mbced in a sterile well of a microtiter plate containing 50 /il of 
serum firee Grace's medium (Life Technologies Inc., Gaithersburg, MD). 
Afterwards 10 ^\ Lipofectin plus 90 ^1 Grace's medium are added, mixed and 
incubated for 15 minutes at room temperature. Then the transfection mixture is 
added drop- wise to the Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is rocked 
back and forth to mix the newly added solution. The plate is then incubated for 
5 hours at 2TC, After 5 hours, the transfection solution is removed fi-om the 
plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. The plate is put back into an incubator and cultivation continued at 
27'*C for four days. 

After four days, the supernatant is collected and a plaque assay is 
performed similar to the assay described by Sunmiers and Smith (supra). As a 
modification, an agarose gel with "Blue Gal" (Life Technologies Inc., 
Gaithersburg) is used, which allows an easy isolation of blue stained plaques. (A 
detailed description of a "plaque assay" can also be found in the user's guide for 
insect cell culture and baculovirology distributed by Life Technologies Inc., 
Gaithersburg, page 9-10). 
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Four days after the serial dilution, the virus is added to the cells, and blue 
stained plaques are picked with the tip of an Eppendorf pipette. The agar 
containing the recombinant viruses is then resuspended in an Eppendorf tube 
containing 200 of Grace*s medium. The agar is removed by a brief 
S centrifugation and the supernatant containing the recombinant baculovirus was 

used to infect Sf9 cells seeded in 35 mm dishes. Four days later the supematants 
of these culture dishes are harvested and then stored at 4°C. 

Sf9 cells were grown in Grace*s medium supplemented with 10% heat- 
inactivated FBS. The cells were infected with the recombinant baculovirus V- 

10 chitinase-alpha at a multiplicity of infection (MOI) of 2. Six hours later, the 

medium is removed and replaced with SF900 II medium minus methionine and 
cysteine (Life Technologies Inc., Gaithersburg). 42 hours later 5 izCi of ^'S- 
methionine and 5 /zCi "S cysteine (Amersham) are added. The cells are further 
incubated for 16 hours before they are harvested by centrifugation and the labelled 

15 proteins visualized by SDS-PAGE and autoradiography. 

Example 3 

Cloning and Ejq>ression in Mammalian Cells 



Most of the vectors used for the transient expression of the chitinase alpha 
protein gene sequence in mammalian cells should carry the SV40 origin of 
20 replication. This allows the replication of the vector to high copy numbers in cells 

(e.g. COS cells) which express the T antigen required for the initiation of viral 
DNA synthesis. Any other mammalian cell line can also be utilized for this 
purpose. 

A typical mammalian expression vector contains the promoter element, 
25 which mediates the initiation of transcription of mRNA, the protein coding 

sequence, and signals required for the termination of trancription and 
polyadenylation of the transcript. Additional elements include enhancers, Kozak 
sequences and intervening sequences flanked by donor and acceptor sites for RNA 
splicing. Highly efficient transcription can be achieved with the early and late 
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promoters from SV40, the long terminal repeats (LTRs) from Retroviruses, e.g. 
RSV, FTTLVI, HIVI and the early promoter of the cytomegaloviojs (CMV). 
However, cellular signals can also be used (e.g. human actin promoter). Suitable 
expression vectors for use in practicing the present invention include, for example, 
vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), pRSVcat 
(ATCC 37152), pSV2dhfr (ATCC 37146) and pBC12KD (ATCC 67109). 
Mammalian host cells that could be used include, human Hela, 283, H9 and 
Jurkart, cells, mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CVl, African 
green monkey cells, quail QCl-3 cells, mouse L cells and Chinese hamster ovary 
cells. 

Alternatively, the gene can be expressed in stable cell lines that contain the 
g©ie integrated into a chromosome. The co-transfeaion with a selectable marker 
such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation of 
the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) is a useful marker to 
develop cell lines that carry several hundred or even several thousand copies of the 
gene of interest. Another usefijl selection marker is the enzyme glutamine synthase 
(GS) (Murphy e/ a/., Biochem J. 227:277-279 (1991); Bebbington et al, 
Bio/Technology 70:169-175 (1992)). Using these markers, the mammalian cells 
are grown in selective medium and the cells with the highest resistance are 
selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) cells are often used for the 
production of proteins. 

The expression vectors pCl and pC4 contain the strong promoter (LTR) 
of the Rous Sarcoma Virus (Cullen et ai. Molecular and Cellular Biology, 
438-4470 (March, 1985)) plus a fragment of the CMV-enhancer (Boshart et ai. 
Cell -^7:521-530 (1985)). Multiple cloning sites, e.g. with the restriction enzyme 
cleavage sites BamHI, Xbal and Asp718, facilitate the cloning of the gene of 
interest.. The vectors contain in addition the 3' intron, the polyadenylation and 
termination signal of the rat preproinsulin gene. 



52 

Example 3(a) Expression in COS Cells 

The expression plasmid, chitinase alpha HA, is made by cloning a cDNA 
encoding chitinase alpha into the expression vector pcDNAI/Amp (which can be 
obtained from Invitrogen, Inc.). 

The expression vector pcDNAI/amp contains: (1) an Rcoli origin of 
replication eflFective for propagation in £. coli and other prokaryotic cells; (2) an 
ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) 
an SV40 origin of replication for propagation in eukaryotic cells; (4) a CMV 
promoter, a polylinker, an S V40 intron, and a polyadenylation signal arranged so 
that a cDNA conveniently can be placed under expression control of the CMV 
promoter and operably linked to the SV40 intron and the polyadenylation signal 
by means of restriction sites in the polylinker, 

A DNA fragment encoding the entire chitinase alpha precursor and an HA 
tag fused in frame to its 3* end is cloned into the polylinker region of the vector so 
that recombinant protein expression is directed by the CMV promoter. The HA 
tag corresponds to an epitope derived from the influenza hemagglutinin protein 
described by Wilson et aL, Ceil 37:161 (1984). The fusion of the HA tag to the 
target protein allows easy detection of the recombinant protein with an antibody 
that recognizes the HA epitope. 

The plasmid construction strategy is as follows. 

The chitinase alpha cDNA of the deposited clone is amplified using primers 
that contain convenient restriction sites, much as described above regarding the 
construction of expression vectors for expression of chitinase alpha in E, coli. To 
facilitate detection, purification and characterization of the expressed chitinase 
alpha, one of the primers contains a hemagglutinin tag ("HA tag") as described 
above. 

Suitable primers include that following, which are used in this example. 
The 5* primer, containing the underiined BamHI site, an AUG start codon and 6 
codons of the 5' coding region has the following sequence: 
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5* CGC GGA TCC GCC ATC ATG GGA GCA ACC ACC ATG (SEQ ID 
NO: 9) 

The 3* primer, containing complementary sequences to the BamHI site, a 
stop codon, 9 codons thereafter forming the hemagglutinin HA tag, and 18 bp of 
3' coding sequence (at the 3' end) has the following sequence: 
5' CGC GGA TCC TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA 
CAG GGA GCC AAG GCT TCT (SEQ ID NO: 10) 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 
digested with Ban>HI and then ligated. The ligation mixture is transformed into 
£. coli strain SURE (available from Stratagene Cloning Systems, 1 1099 North 
Torr^ Pines Road, La Jolia, CA 92037), and the transformed culture is plated on 
ampicillin media plates which then are incubated to allow growth of ampicillin 
resistant colonies. Plasmid DNA is isolated from resistant colonies and examined 
by restriction analysis and gel sizing for the presence of the chemokine alpha- 
encoding fragment. 

For expression of recombinant chemokine alpha, COS cells are transfected 
with an expression vector, as described above, using DEAE*DEXTRAN, as 
described, for instance, in Sambrook, J. et al. Molecular Cloning: A Laboratory 
Manual, Cold Spring Laboratory Press, Cold Spring Harbor, New York (1989). 
Cells are incubated under conditions for expression of chemokine alpha by the 
vector. 

Expression of the chemokine alpha HA fusion protein is detected by 
radiolabelling and immunpprecipitation, using methods described in, for example 
Hariow et ai, Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1988). To this end, two days 
after transfection, the cells are labeled by incubation in media containing "S- 
cysteine for 8 hours. The cells and the media are collected, and the cells are 
washed and the lysed with detergent-containing RIP A buffer: 150 mM NaCl, 1% 
NP-40, 0. 1% SDS, 1% NP.40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described 
by Wilson et al cited above. Proteins are precipitated from the cell lysate and 
from the culture media using an HA-specific monoclonal antibody. The 
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precipitated proteins then are analyzed by SDS-PAGE gels and autoradiography 
An expression product of the expected size is seen in the cell lysate, which is not 
seen in negative controls. 

Example 3(b): Cloning and Expression in CHO Cells 

5 The DNA sequence encoding chitinase alpha protein is amplified using 

PGR oligonucleotide primers specific to the amino terminal sequence of the 
protein and to vector sequences 3' to the gene. Additional nucleotides containing 
restriction sites to facilitate cloning are added to the 5' and 3' sequences 
respectively. The restrictions sites are convenient to restriction enzyme sites in the 

1 0 CHO expression vector PC4 . 

The amplified chitinase alpha protein DNA and the vector PC4 both are 
digested with BamHI and the digested DNAs then ligated together. Insertion of 
the chitinase alpha protein DNA into the BamHI restricted vector places the 
protein coding region downstream of and operably linked to the vector's 

15 promoter. The ligation mixture is transformed into competent CHO cells using 

standard procedures as described, for example, in Sambrook et al, 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). 
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Example 4 

Expression Pattern of Ckitinase Alpha in Human Tissue 

Northern blot analysis is carried out to examine chitinae alpha gene 
expression in human tissues^ using methods described by, among others, 
Sambrook et al, cited above. A cDNA probe containing the entire nucleotide 
sequence of the chitinase alpha protein (SEQ ID N0:1) is labeled with '^P using 
the reJ/prime™ DNA labeling system (Amersham Life Science), according to 
manufacturer's instructions. After labelling, the probe is purified using a 
CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), according to 
manufacturer's protocol number PT 1200-1. The purified labelled probe is then 
used to examine various human tissues for chitinase alpha mRNA. 

Multiple Tissue Northern (MTN) blots containing various human tissues 
(H) or human immune system tissues (IM) are obtained from Clontech and are 
examined with labelled probe using ExpressHyb™ hybridization solution 
(Clontech) according to manufacturer's protocol number PTl 190-1. Following 
hybridization and washing, the blots are mounted and exposed to film at -70'' C 
overnight, and films developed according to standard procedures. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, within the scope of the 
appended claims, the invention may be practiced otherwise than as particularly 
described. 

The disclosures of all patents, patent applications and publications referred 
to herein are hereby incorporated by reference. 
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SEQUENCE LISTING 



(1) GENER?U. INFORMATION: 

(i) APPLICANT: HUTVIAN GENOME SCIENCES, INC. 

9410 KEY WEST AVENUE 
ROCKVILLE, MD 208 50 
APPLICANT/ INVENTORS: THOTAKURA, NAGESWARA RAO 

CHOPRA, ARVIND 
CHOI, GIL 
GENTZ, REINER 
ROSEN, CRAIG 

(ii) TITLE OF INVENTION: CHITINASE 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX, P.L.L.C. 

(B) STREET: 1100 NEW YORK AVENUE, SUITE 600 

(C) CITY: WASHINGTON 

(D) STATE: D.C. 
{E) COUNTRY: US 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: TEA 

(B) FILING DATE: 09-AUG-96 
{C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: GOLDSTEIN, JORGE A. 
(B) REGISTRATION NUMBER: 2 9,021 

(C) REFERENCE /DOCKET NUMBER: 1488.046PC00 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID N0:1: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 82.. 1344 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GAATTCGGCA CGAGGGCTGT CGAAACCTCA GTGGATAAAA GACCTAGAGA ATGTGTATCC 60 

CAGAAGAAGC TGGCCAAGGA T ATG GGA GCA ACC ACC ATG GAC CAG AAG TCT 111 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser 
15 10 

CTC TGG GCA GGT GTA GTG GTC TTG CTG CTT CTC CAG GGA GAG ATG GGG 15 9 

Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu Gin Gly Glu Met Gly 
15 20 25 

TTT TGC TAT GTT GCC AGA GCT GGT CTT GAA CTC CTG GGC TCA AGA AGT 207 
Phe Cys Tyr Val Ala Arg Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser 
30 35 40 

CCT CCT GCC TCA GCC TCC CAA AGT GCT GGG ATA ACA GGA TCT GCC TAG 2 55 

Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly He Thr Gly Ser Ala Tyr 
45 50 55 

AAA CTG GTT TGC TAC TTT ACC AAC TGG TCC CAG GAC COG CAG GAA CCA 303 
Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro 
60 65 70 

GGA AAA TTC ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC 351 
Gly Lys Phe Thr Pro Glu Asn He Asp Pro Phe Leu Cys Ser His Leu 
75 80 85 90 

ATC TAT TCA TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC 3 99 

He Tyr Ser Phe Ala Ser He Glu Asn Asn Lys Val He He Lys Asp 
95 100 105 

AAG AGT GAA GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG 4 47 

Lys Ser Glu Val Met Leu Tyr Gin Thr He Asn Ser Leu Lys Thr Lys 
110 115 120 

AAT CCC AAA CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT 4 95 

Asn Pro Lys Leu Lys He Leu Leu Ser He Gly Gly Tyr Leu Phe Gly 
■ 125~ 130 135 

TCC AAA GGG TTC CAC CCT ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA 54 3 
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Ser Lys Gly Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu 
140 145 150 

TTC ATT AAC TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA 5 91 

Phe lie Asn Ser He He Leu Phe Leu Arg Asn His Asn Phe Asp Gly 
155 160 165 170 



CTG GAT GTA AGC TGG ATC TAC CCA GAT CAG AAA GAA AAC ACT CAT TTC _ 63 9 

Leu Asp Val Ser Trp He Tyr Pro Asp Gin Lys Glu Asn Thr His Phe 
175 IBO 185 



ACT GTG CTG ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA 687 

Thr Val Leu lie His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr 
190 195 200 

AAA TCC ACC AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG 73 5 

Lys Ser Thr Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly 
205 210 215 

AGG CAA ATG ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT 7 83 

Arg Gin Met He Asp Asn Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp 

220 225 230 

CTG GAT TTC ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA 8 31 

Leu Asp Phe He Asn Leu Leu Ser Phe Asp Phe His Gly Ser Trp Glu 
235 240 245 250 

AAG CCC CTT ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG 879 

Lys Pro Leu He Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin 
255 260 265 

GAC AGA GGG CCA AGC TCC TAC TAC AAT GTG GAA TAT GCT GTG GGG TAC 92 7 

Asp Arg Gly Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr 
270 275 280 

TGG ATA CAT AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC 9 75 

Trp He His Lys Gly Met Pro Ser Glu Lys Val Val Met Gly He Pro 
285 290 295 



ACA TAT GGG CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG 1023 
Thr Tyr Gly His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly 
300 305 310 



GCC CCT GCC TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA 10 71 

Ala Pro Ala Ser Gly Pro Gly Ala Ala Gly Pro He Thr Glu Ser Ser 
315 320 325 330 

GGC TTC CTG GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG 1119 
Gly Phe Leu Ala Tyr Tyr Glu He Cys Gin Phe Leu Lys Gly Ala Lys 
335 340 345 



ATC ACG.-AGG CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC 
He Thr Arg Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn 
350 355 360 



1167 
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CAG TGG GTG GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG 1215 
Gin Trp Val Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin 
365 370 375 

TTC TTA AAG AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC 12 6 3 

Phe Leu Lys Asn Leu Asn Leu Gly Gly Ala Met lie Trp Ser lie Asp 
380 385 390 

ATG GAT GAC TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAG CCT CTT ' 1311 
Met Asp Asp Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu 
395 400 405 410 

GTC CAA GCA GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG 1364 
Val Gin Ala Val Lys Arg Ser Leu Gly Ser Leu 
415 420 

AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG 1424 

ATTCCCCTTG CCAGGCCGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA CTTCCTCTTA 14 84 

GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC 1544 

AATAAAAAAA ATTCATTTTG CTCCAGTAAA AAAAAAAAAA AAAAACTCGA G 15 95 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 

(B) TYPE; amino acid 
{D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser Leu Trp Ala Gly Val Val 
1 5 10 15 - ■ 

Val Leu Leu Leu Leu Gin Gly Glu Met Gly Phe Cys Tyr Val Ala Arg 
20 25 30 

Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser Pro Pro Ala Ser Ala Ser 
35 40 45 

Gin Ser Ala Gly He Thr Gly Ser Ala Tyr Lys Leu Val Cys Tyr Phe 
50 55 60 

Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu 
65 . 70 75 80 

Asn He Asp Pro Phe Leu Cys Ser His Leu He Tyr Ser Phe Ala Ser 
85 90 95 

He Glu Asn Asn Lys Val lie He Lys Asp Lys Ser Glu Val Met Leu 
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100 



105 



110 



Tyr Gin Thr He Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys He 
lis 120 125 

Leu Leu Ser He Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro 
130 135 140 

Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe He Asn Ser He He 

150 155 160 

Leu Phe Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val Ser Trp He 
165 170 175 

Tyr Pro Asp Gin Lys Glu Asn Thr His Phe Thr Val Leu He His Glu 
180 185 190 

Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr Lys Ser Thr Lys Glu Arg 
195 200 205 

Leu Leu Leu Thr Ala Gly Val Ser Ala Gly Arg Gin Met He Asp Asn 
210 215 220 

Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp Leu Asp Phe He Asn Leu 
225 230 235 240 

Leu Ser Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu He Thr Gly 
245 250 255 

His Asn Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly Pro Ser Ser 
260 265 27C 

Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp He His Lys Gly Met 
275 280 285 

Pro Ser Glu Lys Val Val Met Gly He Pro Thr Tyr Gly His Ser Phe 
290 295 300 

Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala Ser Gly Pro 
305 310 315 320 

Gly Ala Ala Gly Pro He Thr Glu Ser Ser Gly Phe Leu Ala Tyr Tyr 
325 320 335 

Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg Leu Gin Asp 
340 345 350 

Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val Gly Tyr Asp 
355 360 365 



Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys Asn Leu Asn 
370 - ^ 375 380 



Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp Phe Thr Gly 
385 390 395 400 
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Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala Val Lys Arg 
405 410 415 



Ser Leu Gly Ser Leu 
420 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 82 . . 1344 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
■ GAATTCGGCA CGAGGGCTGT CGAAACCTCA GTGGATAAAA GACCTAGAGA ATGTGTATCC 6 0 

CAGAAGAAGC TGGCCAAGGA T ATG GGA GCA ACC ACC ATG GAC CAG AAG TCT 111 



Met G-ly Ala Thr Thr Met Asp Gin Lys Ser 
15 10 



CTC TGG GCA GGT GTA GTG GTC TTG CTG CTT CTC CAG GGA GAG ATG GGG 
Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu Gin Gly Glu Met Gly 
15 20 25 



159 



TTT TGC TAT GTT GCC AGA GCT GGT CTT GAA CTC CTG GGC TCA AGA AGT 
Phe Cys Tyr Val Ala Arg Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser 
30 35 40 



207 



CCT CCT GCC TCA GCC TCC CAA AGT GCT GGG ATA ACA GGA TCT GCC TAC 
Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly He Thr Gly Ser Ala Tyr 
45 50 55 



255 



AAA CTG GTT TGC TAC TTT ACC AAC TGG TCC CAG GAC CGG CAG GAA CCA 
Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro 
60 65 70 



303 



GGA AAA TTC ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC 
Gly Lys Phe Thr Pro Glu Asn He Asp Pro Phe Leu Cys Ser His Leu 
75 80 85 90 



351 



ATC TAT TCA TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC 
He Tyr^ Ser Phe Ala Ser He Glu Asn Asn Lys Val He He Lys Asp 
95 100 105 



399 



AAG AGT GAA GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG 



447 
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Lys Ser Glu Val Met Leu Tyr Gin Thr lie Asn Ser Leu Lys Thr Lys 
110 115 120 

AAT CCC AAA CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT 4 95 

Asn Pro Lys Leu Lys lie Leu Leu Ser lie Gly Gly Tyr Leu Phe Gly 

125 130 135 

TCC AAA GGG TTC CAC CCT ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA 54 3 

Ser Lys Gly Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu 
140 145 150 

TTC ATT AAC TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA 5 91 

Phe lie Asn Ser lie lie Leu Phe Leu Arg Asn His Asn Phe Asp Gly 
155 160 165 170 

CTG GAT GTA AGC TGG GAA TAC CCA GAT CAG AAA GAA AAC ACT CAT TTC 63 9 

Leu Asp Val Ser Trp Glu Tyr Pro Asp Gin Lys Glu Asn Thr His Phe 
175 180 185 

ACT GTG CTG ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA 687 

Thr Val Leu lie His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr 
190 195 200 

AAA TCC ACC AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG 73 5 

Lys Ser Thr Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly 

205 210 215 

AGG CAA ATG ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT 7B3 

Arg Gin Met lie Asp Asn Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp 
220 225 230 

CTG GAT TTC ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA 831 

Leu Asp Phe lie Asn Leu Leu Ser Phe Asp Phe His Gly Ser Trp Glu 
235 240 245 250 

AAG CCC CTT ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG 87 9 

Lys Pro Leu lie Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin. 

255 . 260 265 ^ ^ 

GAC AGA GGG CCA AGC TCC TAC TAC AAT GTG GAA TAT GCT GTG GGG TAC 92 7 

Asp Arg Gly Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr 
270 275 280 

TGG ATA CAT AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC 97 5 

Trp lie His Lys Gly Met Pro Ser Glu Lys Val Val Met Gly He Pro 

285 290 295 

ACA TAT GGG CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG 102 3 

Thr Tyr Gly His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly 
300 305 310 

GCC CCT GCC TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA 10 71 

Ala Pro Ala Ser Gly Pro Gly Ala Ala Gly Pro He Thr Glu Ser Ser 
315 320 325 330 
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GGC TTC CTG GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG 1119 
Gly Phe Leu Ala Tyr Tyr Glu lie Cys Gin Phe Leu Lys Gly Ala Lys 
335 340 345 

ATC ACG AGG CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC 116 7 

He Thr Arg Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn 
350 355 360 

CAG TGG GTG GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG ' 1215 
Gin Trp Val Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin 
365 370 375 

TTC TTA AAG AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC 126 3 

Phe Leu Lys Asn Leu Asn Leu Gly Gly Ala Met He Trp Ser He Asp 
380 385 390 

ATG GAT GAC TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAC CCT CTT 1311 
Met Asp Asp Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu 
335 400 405 410 

GTC CAA GCA GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG 1364 
Val Gin Ala Val Lys Arg Ser Leu Gly Ser Leu 
415 420 

AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG 14 24 

ATTCCCCTTG CCAGGCCGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA CTTCCTCTTA 14 84 

GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC 1544 

AATAAAAAAA ATTCATTTTG CTCCAGTAAA AAAAAAAAAA AAAAACTCGA G 15 95 



(2} INFORMATION FOR SEQ ID N0:4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 21 amino acids 
CB) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Gly Ala Thr 
1 

Val Leu Leu Leu 
20 

Ala Gly Leu Glu 
35 

Gin Ser Ala Gly 
50 



Thr Met Asp Gin 
5 

Leu Gin Gly Glu 

Leu Leu Gly Ser 
40 

He Thr Gly Ser 
55 



Lys Ser Leu Trp 
10 

Met Gly Phe Cys 
25 

Arg Ser Pro Pro 



Ala Tyr Lys Leu 
60 



Ala Gly Val Val 
15 

Tyr Val Ala Arg 
30 

Ala Ser Ala Ser 
45 

Val Cys Tyr Phe 
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Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu 
65 70 75 80 

Asn lie Asp Pro Phe Leu Cys Ser His Leu He Tyr Ser Phe Ala Ser 
85 90 95 

He Glu Asn Asn Lys Val He He Lys Asp Lys Ser Glu VaL Met Leu 
100 105 110 

Tyr Gin Thr He Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys He 
115 120 125 

Leu Leu Ser He Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro 
130 135 140 

Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe He Asn Ser He He 
145 150 155 160 

Leu Phe Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val Ser Trp Glu 
165 170 175 

Tyr Pro Asp Gin Lys Glu Asn Thr His Phe Thr Val Leu He His Glu 
180 185 190 

Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr Lys Ser Thr Lys Glu Arg 
195 200 205 

Leu Leu Leu Thr Ala Gly Val Sef Ala Gly Arg Gin Met He Asp Asn 
210 215 220 

Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp Leu Asp Phe He Asn Leu 
225 230 235 240 

Leu Ser Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu He Thr Gly 
245 250 255 

His Asn Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly Pro Ser Ser 
260 265 270 

Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp He His Lys Gly Met 
275 280 285 

Pro Ser Glu Lys Val Val Met Gly He Pro Thr Tyr Gly His Ser Phe 
290 295 300 

Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala Ser Gly Pro 
305 310 315 320 

Gly Ala Ala Gly Pro He Thr Glu Ser Ser Gly Phe Leu Ala Tyr Tyr 
325 330 335 

Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg Leu Gin Asp 
340 345 350 

Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val Gly Tyr Asp 
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355 360 365 

Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys Asn Leu Asn 
370 375 380 

Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp Phe Thr Gly 
385 390 395 400 

Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala Val Lys Arg 
405 410 415 

Ser Leu Gly Ser Leu 
420 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 
fA) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGCGGATCCA TGGGAGCAAC CACCATGGAC CAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCGGATCCG CTTCTCTGTA AGTTAATCCT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGCGGATCCG CCATCATGGG AGCAACCACC ATG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0;8: 
CGCGGATCCG CTTCTCTGTA AGTTAATCCT 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
CGCGGATCCG CCATCATGGG AGCAACCACC ATG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCGGATCCT CAAGCGTAGT CTGGGACGTC GTATGGGTAC AGGGAGCCAA GGCTTCT 



57 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 27.. 1181 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGAATTCGGC ACGAGGAGCA ACCACC ATG GAC CAG AAG TCT CTC TGG GCA GGT 53 

Met Asp Gin Lys Ser Leu Trp Ala Gly 

1 5 

GTA GTG GTC TTG CTG CTT CTC CAG GGA GGA TCT GCC TAC AAA CTG GTT 101 
Val Val Val Leu Leu Leu Leu Gin Gly Gly Ser Ala Tyr Lys Leu Val 
10 15 20 25 

TGC TAC TTT ACC AAC TGG TCC CAG GAC CGG CAG GAA CCA GGA AAA TTC 14 9 

Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe 
30 35 40 

ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC ATC TAT TCA 197 
Thr Pro Glu Asn He Asp Pro Phe Leu Cys Ser His Leu He Tyr Ser 
45 50 55 

TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC AAG AGT GAA 245 
Phe Ala Ser He Glu Asn Asn Lys Val He He Lys Asp Lys Ser Glu 
60 65 70 

GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG AAT CCC AAA 2 93 

Val Met Leu Tyr Gin Thr He Asn Ser Leu Lys Thr Lys Asn Pro Lys 
75 80 85 

CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT TCC AAA GGG 341 
Leu Lys He Leu Leu Ser He Gly Gly Tyr Leu Phe Gly Ser Lys Gly 
90 95 100 105 

TTC CAC CCT "'ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA TTC ATT AAC 38 9 

Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe He Asn 
110 115 120 
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TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA CTG GAT GTA 43 7 

Ser lie lie Leu Phe Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val 
125 130 135 

AGC TGG.ATC TAG CCA GAT GAG AAA GAA AAC ACT CAT TTC ACT GTG CTG 48 5 

Ser Trp lie Tyr Pro Asp Gin Lys Glu Asn Thr His Phe Thr Val Leu 
140 145 150 

ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA AAA TCC ACC '53 3 

lie His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr Lys Ser Thr 
155 160 165 

AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG AGG CAA ATG 581 
Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly Arg Gin Met 
170 175 180 185 

ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT CTG GAT TTC 629 
lie Asp Asn Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp Leu Asp Phe 
190 195 200 

ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA AAG CCC CTT 6 77 

lie Asn Leu Leu Ser Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu 
205 210 215 

ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG GAC AGA GGG 72 5 

lie Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly 
220 225 230 

CCA AGC TCC TAC TAG AAT GTG GAA TAT GCT GTG GGG TAC TGG ATA CAT 773 
Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp lie His 
235 240 245 

AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC ACA TAT GGG 821 
Lys Gly Met Pro Ser Glu Lys Val Val Met Gly lie Pro Thr Tyr Gly 
250 255 260 265 

CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG GCC CCT GCC 869 
His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala ' 
270 275 280 

TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA GGC TTC CTG 917 
Ser Gly Pro Gly Ala Ala Gly Pro lie Thr Glu Ser Ser Gly Phe Leu 
285 290 295 

GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG ATC ACG AGG 96 5 

Ala Tyr Tyr Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg 
300 305 310 

CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC CAG TGG GTG 1013 
Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val 
315 320 325 

GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG TTC TTA AAG 1061 
Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys 
330 335 340 345 



-69. 

AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC ATG GAT GAC 1109 

Asn Leu Asn Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp 
350 355 360 

TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAG CCT CTT GTC CAA GCA 1157 
Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala 
365 370 375 

GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG AAGCAGGCAA ' 1211 
Val Lys Arg Ser Leu Gly Ser Leu 
380 385 

GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG ATTCCCCTTG 1271 

CCAGGCCGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA CTTCCTCTTA GATCATAGAT 13 31 

TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC AATAAAAAAA 13 91 

ATTCATTTTG CTCCAAAAAA AAAAAAAAAA AAAAACTCGA G 1432 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asp Gin Lys Ser Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu 
15 10 15 

Gin Gly Gly Ser Ala Tyr Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser 
20 25 30 

Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu Asn He Asp Pro 
35 40 45 

Phe Leu Cys Ser His Leu He Tyr Ser Phe Ala Ser He Glu Asn Asn 
50 55 60 

Lys Val He He Lys Asp Lys Ser Glu Val Met Leu Tyr Gin Thr He 
^5 70 75 80 

Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys He Leu Leu Ser He 
85 90 95 

Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro Met Val Asp Ser 
100 105 110 

Ser Thr Ser Arg Leu Glu Phe He Asn Ser He He Leu Phe Leu Arg 
H5 120 125 
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Asn His Asn Phe Asp 
130 

Lys Glu Asn Thr His 
145 

Phe Gin Lys Asp Phe 
165 

Ala Gly Val Ser Ala 
180 

Glu Lys Leu Ala Lys 
195 

Phe His Gly Ser Trp 
210 

Leu Ser Lys Gly Trp 
225 

Glu Tyr Ala Val Gly 
245 

Val Val Met Gly He 
260 

Ala Glu Thr Thr Val 
275 

Pro He Thr Glu Ser 
290 

Phe Leu Lys Gly Ala 
305 

Tyr Ala Val Lys Gly 
325 

Met Glu Thr Lys Val 
340 

Met He Trp Ser He 
355 

Gin Gly Pro Tyr Pro 
370 
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Gly Leu Asp Val Ser Trp 
135 

Phe Thr Val Leu He His 
150 155 

Thr Lys Ser Thr Lys Glu 
170 

Gly Arg Gin Met He Asp 
185 

Asp Leu Asp Phe He Asn 
200 

Glu Lys Pro Leu He Thr 
215 

Gin Asp Arg Gly Pro Ser 
230 235 

Tyr Trp He His Lys Gly 
250 

Pro Thr Tyr Gly His Ser 
265 

Gly Ala Pro* Ala Ser Gly 
280 

Ser Gly Phe Leu Ala Tyr 
295 

Lys He Thr Arg Leu Gin 
310 315 

Asn Gin Trp Val Gly Tyr 
330 

Gin Phe Leu Lys Asn Leu 
345 

Asp Met Asp Asp Phe Thr 
360 

Leu val Gin Ala Val Lys 
375 



He Tyr Pro Asp Gin 
140 

Glu Leu Ala Glu Ala 
160 

Arg Leu Leu Leu Thr 
175 

Asn Ser Tyr Gin Val 
190 

Leu Leu Ser Phe Asp 

205 

Gly His Asn Ser Pro 
220 

Ser Tyr Tyr Asn Val 
240 

Met Pro Ser Glu Lys 
255 

Phe Thr Leu Ala Ser 
270 

Pro Gly Ala Ala Gly 
285 

Tyr Glu lie Cys Gin 
300 

Asp Gin Gin Val Pro 
320 

Asp Asp Val Lys Ser 
335 

Asn Leu Gly Gly Ala 
350 

Gly Lys Ser Cys Asn 
365 

Arg Ser Leu Gly Ser 
380 



Leu 
385 
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Applicant's or ^g«"^'*/*|?jo 
reference number 14o8,04v J 



;00 



International appl tcaiion No 
^ 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule nbis) 



A. The indications made below relate to the microorganism referred to in the description 

5 16 
on page » . 



B, IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 

Anerican Type Culture Collection 



Address of depositary institution (iftctuding posiai codeaiui coumry) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United State of America 



Date of deposit 

May 2, 1996 



Accession Number 

ATCC Designation 97529 



C. ADDITIONAL INDICATIONS (leave biank if not applicable} This information is continued on an additional sheet | { 



DNA plasmid 1290853 {Docket PF286) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications ore not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable} 



The indications listed below will be submitted to the Imernational Bureau later (spccifythe general nature of the indications e.g., 'Accession 
Number of Deposit') 




For receiving Office use only 



sheet was received with the mternationai application 



A/thprizcd officer 




For Iniernaiional Bureau use only 



[ I This sheet was received by the Internationa! Bureau on: 



Authorized officfr 
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fVhat Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide 
having a nucleotide sequence at least 95% identical to a sequence selected from 
the group consisting of: 

(a) a nucleotide sequence encoding the chitinase alpha 
polypeptide having the complete amino acid sequence in Figure 1 [SEQ ID N0:2] 
or Figure 2 [SEQ ID N0:4]; 

(b) a nucleotide sequence encoding the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27 - 421 in Figure 1 
[SEQ ID N0:2] or Figure 2 [SEQ ID N0:4]; 

(c) a nucleotide sequence encoding the chitinase alpha 
polypeptide having the complete amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(d) a nucleotide sequence encoding the mature chitinase alpha 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(e) a nucleotide sequence encoding the chitinase aipha-2 
polypeptide having the complete amino acid sequence shown in Figure 5 [SEQ ID 
NO: 12]; 

(f) a nucleotide sequence encoding the mature chitinase alpha- 
2 polypeptide having the amino acid sequence at positions 21-385 in Figure 5 
[SEQ ID NO: 12]; and 

(g) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), or (f). 

2. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the complete nucleotide sequence in Figure 1 [SEQ ID NO: 1 ], Figure 2 [SEQ 
ID N0:3], or Figure 5 [SEQ ID NO: 1 1]. 
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3. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence in Figure 1 [SEQ ID NO:l], Figure 2 [SEQ ID 
N0:3], or Figure 5 [SEQ ID NO: 11] encoding the polypeptide having the 
complete amino acid sequence in Figure 1 [SEQ ID NO:2], Figure 2 [SEQ ID 
N0:41, or Figure 5 [SEQ ED NO: 12], respectively. 

4. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence in Figure 1 [SEQ ID N0:1], Figure 2 [SEQ ID 
N0:3], or Figure 5 [SEQ ID N0:11] encoding the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27 - 421 in Figure 1 
[SEQ ID N0:2] or Figure 2 [SEQ ID N0:4] or encoding the mature chitinase 
alpha-2 polypeptide having the amino acid sequence at positions 21-385 in Figure 
5 [SEQ ID NO: 12] 

5. The nucleic acid molecule of claim I wherein said polynucleotide 
has the complete nucleotide sequence of the cDNA clone contained in ATCC 
Deposit No. 97529. 

6. The nucleic acid molecule of claim I wherein said polynucleotide 
has the nucleotide sequence encoding the chitinase alpha polypeptide having the 
complete amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97529. 

7. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence encoding the mature chitinase alpha polypeptide 
havmg the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97529. 

8. An isolated nucleic acid molecule comprising a polynucleotide 
which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), (c), (d), 
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(e), (0, or (g) of claim 1 wherein said polynucleotide which hybrizes does not 
hybridize under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. 

9. An isolated nucleic acid molecule comprising a polynucleotide 
5 which encodes the amino acid sequence of an epitope-bearing portion of a 

chitinase alpha or chhinase alpha-2 polypeptide having an amino acid sequence in 
(a), (b), (c), (d), (e), or(f) of claim I. 

10. The isolated nucleic acid molecule of clahn 9, which encodes an 
epitope-bearing portion of a chitinase alpha or chitinase alpha-2 polypeptide 

10 selected from the group consisting of a polypeptide comprising amino acid 

residues from about 38 to about 51 in Figure 1 (SEQ ED N0:2); a polypeptide 
comprising amino acid residues from about 66 to about 85 in Figure 1 (SEQ ED 
NO:2); a polypeptide comprising amino acid residues from about 117 to about 
127 in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues 

15 from about 145 to about 153 in Figure 1 (SEQ ID N0:2); a polypeptide 

comprising amino acid residues from about 165 to about 172 in Figure 1 (SEQ ID 
N0:2); a polypeptide comprising amino acid residues from about 177 to about 
185 in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues 
from about 245 to about 254 in Figure 1 (SEQ ID N0:2); a polypeptide 

20 comprising amino acid residues from about 256 to about 274 in Figure 1 (SEQ ID 

N0;2); a polypeptide comprising amino acid residues from about 286 to about 
293 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues 
from about 366 to about 376 in Figure 1 (SEQ ID N0;2); and a polypeptide 
comprising amino acid residues from about 394 to about 407 in Figure 1 (SEQ ID 

25 N0:2). 



11. A method for making a recombinant vector comprising inserting 
an isolated nucleic acid molecule of claim 1 into a vector. 
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12. A recombinant vector produced by the method of claim 1 1 . 

13. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 12 into a host cell. 

14. A recombinant host cell produced by the method of claim 13, 

15. A recombinant method for producing a chitinase alpha or chitinase 
alpha-2 polypeptide, comprising culturing the recombinant host cell of claim 14 
under conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

16. An isolated chitinase alpha or chitinase alpha-2 polypeptide having 
an amino acid sequence at least 95% identical to a sequence selected from the 
group consisting of: 

(a) the amino acid sequence of the chitinase alpha polypeptide 
having the complete amino acid sequence in Figure 1 [SEQ ID NO:2] or Figure 
2 [SEQ ID N0:41; 

(b) the amino acid sequence of the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27 - 421 in Figure 1 
[SEQ ID N0:2] or Figure 2 [SEQ ID N0:4]; 

(c) the amino acid sequence of the chitinase alpha polypeptide 
having the complete amino acid sequence encoded by the cDNA clone contained 
in ATCC Deposit No. 97529; and 

(d) the amino acid sequence of the mature chitinase alpha 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(e) the amino acid sequence of the chitinase alpha-2 
polypeptide having the complete 385 amino acid sequence including the leader 
sequence shown in Figure 5 [SEQ ID NO; 12]; 
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(0 the amino acid sequence of the mature chitinase alpha-2 
polypeptide (without the leader) having the amino acid sequence at positions 21- 
385 in Figure 5 [SEQ ID NO: 12], and 

(e) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), or (f). 

1 7. An isolated polypeptide comprising an epitope-bearing portion of 
chitinase alpha, wherein said portion is selected from the group consisting of: a 
polypeptide comprising amino acid residues from about 38 to about 51 in Figure 
1 (SEQ ID N0:2); a polypeptide comprising amino acid residues from about 66 
to about 85 in Figure 1 (SEQ ID NO: 2); a polypeptide comprising amino acid 
residues from about 117 to about 127 in Figure 1 (SEQ ID N0:2); a polypeptide 
comprising amino acid residues from about 145 to about 1 53 in Figure 1 (SEQ ID 
NO: 2); a polypeptide comprising amino acid residues from about 165 to about 
172 in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues 
from about 177 to about 185 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 245 to about 254 in Figure 1 (SEQ ID 
N0:2); a polypeptide comprising amino acid residues from about 256 to about 
274 in Figure 1 (SEQ ID N0:2); a polypeptide comprising amino acid residues 
from about 286 to about 293 in Figure 1 (SEQ ID N0:2); a polypeptide 
comprising amino acid residues from about 366 to about 376 in Figure 1 (SEQ ID 
NO: 2); and a polypeptide comprising amino acid residues from about 394 to about 
407 in Figure 1 (SEQ ID N0:2). 

1 8. An isolated antibody that binds specifically to a chitinase alpha or 
chitinase alpha-2 polypeptide of claim 16. 

19. A method for treating a patient having need of anti-fimgal therapy 
comprising administering to said patient a therapeutically effective amount of the 
polypeptide of claim 16. 
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20. A method for treating a patient having a tissue remodeling 
disorder, comprising administering to said patient a therapeutically effective 
amount of an antagonist of the polypeptide of claim 16. 



1/12 

10 30 50 

gaattcggcacgagggctgtcgaaacctcagtggataaaagacctagagaatgtgtatcc 

70 90 110 

cagaagaagctggccaaggatATGGGAGCAACCACCATGGACCAGAAGTCTCTCTGGGCA 

MGATTMDQKSLWA 

130 150 170 

GGTGTAGTGGTCnGCTGCTTCTCCAGGGAGAGATGGGGTTTTGCTATGnGCCAGAGCT 
G V V V L L L L Q G E M G F C Y V A R A 

190 210 230 

GGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATA 
GLELL6SRSPPASASQSAGI 

250 270 290 

ACAGGATCTGCCTACAAACTGGTTTGCTACTTTACCAACTGGTCCCAGGACCGGCAGGAA 
T6SAYKLVCYFTNWSQDRQE 

310 330 350 

CCAGGAAAAnCACCCCTGAGAATATTGACCCCnCCTATGCTCTCATCTCATCTATTCA 
PGKFTPENIDPFLCSHLIYS 

370 390 410 

nCGCCAGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCTCTAC 
FASIENNKVIIKDKSEVMLY 

430 450 470 

CAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAAnCTCnGTCCATTGGA 
QTINSLKTKNPKLKILLSIG 

490 510 530 

GGGTACCTGnTGGTTCCAAAGGGnCCACCCTATGGTGGATTCnCTACATCACGCnG 
GYLFGSKGFHPMVDSSTSRL 

550 570 590 

GAAnCAnAACTCCATAATCCTGTTTCTGAGGAACCATAACTnGATGGACTGGATGTA 
EFINSIILFLRNHNFDGLDV 

610 630 650 

AGCTGGATCTACCCAGATCAGAAAGAAAACACTCATTTCACTGTGCTGATTCATGAGnA 
SWIYPDQKENTHFTVLIHEL 

670 690 710 

GCAGAAGCCTTTCAGAAGGACnCACAAAATCCACCAAGGAAAGGCTTCTCTTGACTGCG 
AEAFQKDFTKSTKERLLLTA 

730 750 770 

GGCGTATCTGCAGGGAGGCAAATGAnGATAACAGCTATCAAGTTGAGAAACTGGCAAAA 
GV5AGRQMIDNSYQVEKLAK 

790 810 830 

GATCTGGATTTCATCAACCTCCTGTCCTTTGACTTCCATGGGTCnGGGAAAAGCCCCTT 
DLDFINLLSFDFHGSWEKPL 



FIG.1A 
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850 870 890 

ATCACTGGCCACAACAGCCCTCTGAGCAAGGGGTGGCAGGACA6AGG6CCAAGCTCCTAC 
ITGHNSPLSKGWQDRGPSSY 

910 930 950 

TACAATGTGGAATATGCTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTG 
YNVEYAVGYWIHKGMPSEKV 

970 990 1010 

GTCATGGGCATCCCCACATATGGGCACTCCnCACACTGGCCTCTGCAGAAACCACCGTG 
VMGIPTYGHSFTLASAETTV 

1030 1050 1070 

GGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGTCTTCAGGCTTCCTG 
GAPASGPGAA6PITESSGFL 

1090 1110 1130 

GCCTAnATGAGATCTGCCAGTTCCTGAAAGGAGCCAAGATCACGAGGCTCCAGGATCAG 
AYYEICQFLKGAKITRLQDQ 

1150 1170 1190 

CAGGnCCCTACGCAGTCAAGGGGAACCAGTGGGTGGGCTATGATGATGTGAAGAGTATG 
QVPYAVKGNQWVGYDDVKSM 

1210 1230 1250 

GAGACCAAGGnCAGnCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATT 
ETKVQFLKNLNL6GAMIWSI 

1270 1290 1310 

GACATGGATGACnCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCTTGTCCAAGCA 
DMDDFTGKSCNQGPYPLVQA 

1330 1350 1370 

GTCAAGAGAAGCCTTGGCTCCCTGTGAaggattaacttacagagaagcaggcaagatgac 
VKRSLGSL* 

1390 1410 1430 

cttgctgcctggggcctgctctctcccaggaattctcatgtgggattccccttgccaggc 

1450 1470 1490 

cggcctttggatctctcttccaagcctttcctgacttcctcttagatcatagattggacc 

1510 1530 1550 

tggttttgttttcctgcagctgatgacttgttgccctgaagtacaataaaaaaaattcat 

1570 1590 
tttgctccagtaaaaaaaaaaaaaaaaaactcgag 

FIG. IB 
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10 30 50 

gaattcggcacgagggctgtcgaaacctcagtggataaaagacctagagaatgtgtatcc 

70 90 110 

cagaagaagctggccaaggatATGGGAGCAACCACCATGGACCAGAAGTCTCTCTGGGCA 

MGATTMDOKSLWA 

130 150 170 

GGTGTAGTGGTCTTGCTGCTTCTCCAGGGAGAGATGGGGTTnGCTATGTTGCCAGAGCT 
r. \1 \1 M \ I I LOG E M G F C Y V A R A 

190 210 230 

GGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATA 
GLELLGSR5PPASASQSAGI 

250 270 290 

ACAGGATCTGCCTACAAACTGGTTTGCTACTTTACCAACTGGTCCCAGGACCGGCAGGAA 
T6SAYKLVCYFTNWSQDRQE 

310 330 350 

CCAGGAAAATTCACCCCTGAGAATAnGACCCCTTCCTATGCTCTCATCTCATCTATTCA 
PGKFTPENIDPFLCSHLIYS 

370 390 410 

TTCGCCAGCATCGAAAACAACAAGGnATCATCAAGGACAAGAGTGAAGTGATGCTCTAC 
FASIENNKVIIKDKSEVMLY 

430 450 470 

CAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAAnCTCTTGTCCAnGGA 
QTINSLKTKNPKLKILLSIG 

490 510 530 

GGGTACCTGTnGGTTCCAAAGGGTTCCACCCTATGGTGGATTCTTCTACATCACGCTTG 
6YLF6SKGFHPMVDSSTSRL 

550 570 590 

GAATTCAnAACTCCATAATCCTGTnCTGAGGAACCATAACrrrGATGGACTGGATGTA 
EFINSIILFLRNHNFDGLDV 

610 630 650 

AGCTGGGAATACCCAGATCAGAAAGAAAACACTCATTTCACTGTGCTGAnCATGAGnA . 
SWEYPDQKENTHFTVLIHEL 

670 690 710 

GCAGAAGCCTTTCAGAAGGACnCACAAAATCCACCAAGGAAAGGCTTCTCnGACTGCG 
AEAFQKOFTKSTKERLLLTA 

730 750 770 

GGCGTATCTGCAGGGAGGCAAATGAnGATAACAGCTATCAAGnGAGAAACTGGCAAAA 
6VSAGRQMIDNSYQVEKLAK 

790 810 830 

GATCTGGATnCATCAACCTCCTGTCCTnGACTTCCATGGGTCnGGGAAAAGCCCCn 
DLDFINLLSFDFHGSWEKPL 



FIG.2A 

SUBSTITUTE SHEET (RULE 26) 
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850 870 890 

ATCACTGGCCACAACAGCCCTCTGA6CAAGGGGTGGCAGGACAGAGGGCCAAGCTCCTAC 
IT6HNSPLSKGWQDR6PSSY 

910 930 950 

TACAATGTGGAATATGCTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTG 
YNVEYAV6YWIHKGMPSEKV 

970 990 1010 

GTCATGGGCATCCCCACATATGGGCACTCCTTCACACTGGCCTCTGCAGAAACCACCGTG 
VM6IPTYGHSFTLASAETTV 

1030 1050 1070 

GGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGTCTTCAGGCnCCTG 
GAPASGPGA AGPITESS6FL 

1090 1110 1130 

GCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCCAAGATCACGAGGCTCCAGGATCAG 
AYYEICQFLKGAKITRLQDQ 

1150 1170 1190 

CAGGnCCCTACGCAGTCAAGGGGAACCAGTGGGTGGGCTATGATGATGTGAAGAGTATG 
QVPYAVKGNQWVGYODVKSM 

1210 1230 1250 

GAGACCAAGGTTCAGTTCTTAAAGAATTTAAACCTGGGA6GAGCCATGATCTGGTCTATT 
ETKVQFLKNLNLGGAMIWSI 

1270 1290 1310 

GACATGGATGACnCACTGGCAAATCCTGCAACCAGGGCCCnACCCTCnGTCCAAGCA 
OMDDFTGKSCNQGPYPLVQA 

1330 1350 1370 

GTCAAGAGAAGCCnGGCTCCCTGTGAaggattaacttacagagaagcaggcaagatgac 
VKRSLGSL* 

1390 1410 1430 

cttgctgcctggggcctgctctctcccaggaattctcatgtgggattccccttgccaggc 

1450 1470 1490 

cggcctttggatctctcttccaagcctttcctgacttcctcttagatcatagattggacc 

1510 1530 1550 

tggttttgttttcctgcagctgatgacttgttgccctgaagtacaataaaaaaaattcat 

1570 1590 
tttgctccagtaaaaaaaaaaaaaaaaaactcgag 



FIG.2B 
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10 20 30 

HGATTHDQKSLWAGVVVLLLL QGEHGFCYVl hpnfwSl 
^j^lTypppiy^V 9P-39 

R$v4/ACr%Ll >IIP ch i totr i os i dose 

50 60 

hpnfw51 
9P-39 

chitotriosidase 




91 
55 
55 



1 YSFASIE NNKVI IKDKSEy MLYQT INSLK 
!YSF#|MHIDTyE^^ 
I^AGMTWOLSTTEyNDETlLYI 



9P-39 

chitotriosidase 



hpnfwSl 
9P-39 

chitotriosidase 



130 140 150 
121 iTKNPKLKILLSIGGYLFGSKGF HPHVDSStj hpnfwSl 
85 l#IP|iK|flLS|viGG|WiFGS|Qte 9p-39 

OKQTll^TAN chitotriosidase 




SRLEFINSIILFLRNHNFDaDVSyiYPDQl hpnfwSl 



R gp-39 

iS chitotriosidase 




190 



200 



181^— -|NTHFTV 

145 R DKCliFTff 

145 QGSPAVDKES"T|T 



L IHELAEAFQKDFTKSTJ hpnfwSl 
jpcMpQ^ gp-39 
QCEMFuQEAQTSG ch i totr i os i dase 



210 



220 



230 



206 
169 
175 



KERLLLTAGVSAGRQMl DNSYQVEKLAKDL 
l@.LL 
KERLLL 




hpnfwSl 
9P-39 

chitotriosidase 



240 



250 



260 



236 DF INLLSFDFHGSyEKPL ITGHNSPLSKGWJ hpnfwSl 

199 DFjpHTYlirHG&g^GlftPlI^ 

205 iDFlfolAY lDFHGSyEi^ -V lTGHNSPLl Yl^Q chitotriosidase 



270 



280 



290 



266 IQDRGPSSYYNVEYAVGYWIHKGMPSEKVVM hpnfwSl 
227 q^tpRFMYAVGYfCp^ 
233 EESBAAASljiDMSlOKQTBASKLlL chi 



totr i OS i dase 



300 



296 
257 
263 



GIPTYGHSFTLASi 
GIPTBGfeFTLA$-| 
( flPTYGNSFTLA S^S: 




320 



TTVGAPASGPGAAG 

GAP|fcGPGr 
GAPA 



hpnfwSl 
9P-39 

chitotriosidase 
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10 30 50 

GGAATTCGGCACGAGGAGCAACCACCATGGACCAGAAGTCTCTCTGGGCAGGTGTAGTGG 

M n Q K S L W A G V V \/ 

70 90 110 

TCnGCTGCnCTCCAGGGAGGATCTGCCTACAAACTGGTTTGCTACTTTACCAACTGGT 
I- 1- L L Q G G S AYKLVCYFTNWS 
130 150 170 

CCCAGGACCGGCAGGAACCAGGAAAATTCACCCCTGAGAATATTGACCCCTTCCTATGCT 
QDRQEPGKFTPENIDPFLCS 
190 210 230 

CTCATCTCATCTATTCATTCGCCAGCATCGAAAACAACAAGGTTATCATCAAGGACAAGA 
HLIYSFASIENNKVIIKDKS 
250 270 290 

GTGAAGTGATGCTCTACCAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAA 
EVMLYQTINSLKTKNPKLKI 
310 330 350 

TTCTCnGTCCATTGGAGGGTACCTGTTTGGTTCCAAAGGGTTCCACCCTATGGTGGAn 
LLSIGGYLFGSKGFHPMVDS 
370 390 410 

CTTCTACATCACGCnGGAATTCATTAACTCCATAATCCTGTTTCTGAGGAACCATAACT 
S TSRLEFINSIILFLRNHNF 
430 450 470 

TTGATGGACTGGATGTAAGCTGGATCTACCCAGATCAGAAAGAAAACACTCATTTCACTG 
D6LDVSWIYPDQKENTHFTV 
490 510 530 

TGCTGAnCATGAGTTAGCAGAAGCCnTCAGAAGGACnCACAAAATCCACCAAGGAAA 
LIHELAEAFQKDFTKSTKER 
550 570 590 

GGCnCTCTTGACTGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAG 
LLLTAGVSAGRQMIDNSYQV 
610 630 650 

nGAGAAACTGGCAAAAGATCTGGATTTCATCAACCTCCTGTCCnTGACTTCCATGGGT 
EKLAKDLDFINLLSFDFHGS 
670 690 710 

CTTGG6AAAAGCCCCTTATCACTGGCCACAACA6CCCTCTGAGCAAGGGGTGGCA6GACA 
WEKPLITGHNSPLSKGWQDR 
730 750 770 

GAGGGCCAAGCTCCTACTACAATGTGGAATATGCTGTGGGGTACTGGATACATAAGGGAA 
GPSSYYNVEYAVGYWIHKGM 
790 810 830 

TGCCATCAGAGAAGGTGGTCATGGGCATCCCCACATATGGGCACTCCnCACACTGGCCT 
PSEK-V VMGIPTYGHSFTLAS 



FIG.5A 
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850 870 890 

CTGCAGAAACCACCGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTG6ACCCATCACAG 
AETTVGAPASGPGAAGPITE 

910 930 950 

AGTCTTCAGGCTTCCTGGCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCCAAGATCA 
SSGFLAYYEICQFLKGAKIT 
970 990 1010 

CGAGGCTCCAGGATCAGCAGGTTCCCTACGCAGTCAAGGGGAACCAGTGGGTG6GCTATG 
RLQDQQVPYAVKGNQWVGYD 

1030 1050 1070 

ATGATGTGAAGAGTATGGAGACCAAGGTTCAGTTCnAAAGAATTTAAACCTGGGAGGAG 
DVKSMETKVQFLKNLNLGGA 

1090 1110 1130 

CCATGATCTGGTCTAnGACATGGATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTT 
MIWSIDMDDFTGKSCNQGPY 

1150 1170 1190 

ACCCTCTTGTCCAAGCAGTCAAGAGAAGCCnGGCTCCCTGTGAAGGAnAACTTACAGA 
PLVQAVKRSLGSL* 

1210 .1230 1250 

GAAGCAGGCAAGATGACCTTGCTGCCTGGGGCCTGCTCTCTCCCAGGAATTCTCATGTGG 

1270 1290 1310 

GATTCCCCTTGCCAGGCCGGCCTTTGGATCTCTCTTCCAAGCCTTTCCTGACTTCCTCTT 

1330 1350 1370 

AGATCATAGAnGGACCTGGTTTTGTTTTCCTGCAGCTGATGACTTGTTGCCCTGAAGTA 

1390 1410 1430 

CAATAAAAAAAAnCATTTTGCTCCAAAAAAAAAAAAAAAAAAAAACTCGAG 
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1 . . . .GGAATTCGGCACGAGGAGCAACCACCATGGACCAGAAGTCTCTCT6 46 

I li llllllllllliillllllllllillMllii 
251 aagctggccaaggatATGGGAGCAACCACCATGGACCAGAAGTCTCTCTG 300 

47 GGCAGGTGTAGTGGTCTTGCTGCTTCTCCAGG6 79 

lllllllllllllllliiillllllllililll 
301 GGCAGGTGTAGTGGTCTTGCTGCTTCTCCAGGGAGAGATGGGGTTTTGCT 350 



351 ATGTTGCCAGAGCTGGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCC 400 

80 AGGATCTGCCTACAAACTGGTTTG 103 

llllllilllllllllllllMII 
401 TCAGCCTCCCAAAGTGCTGGGATAACAGGATCTGCCTACAAACTGGTTTG 450 

104 CTACTTTACCAACTGGTCCCAGGACCGGCAGGAACCAGGAAAATTCACCC 153 

lllllillllliilllllllllllllllilllMllllllllllllllli 

451 CTACTTTACCAACTGGTCCCAGGACCGGCAGGAACCAGGAAAATTCACCC 500 
154 CTGAGAATATTGACCCCTTCCTATGCTCTCATCTCATCTATTCAnCGCC 203 

Nllllllllllilllllllllllllllllllll|j||||ill!i|||ll 

501 CTGAGAATAnGACCCCTTCCTATGCTCTCATCTCATCTATTCAnCGCC 550 
204 AGCATCGAAAACAACAAGGnATCATCAAGGACAAGAGTGAAGTGATGCT 253 

IIMIIIIIIIIIIIIIIIIIIIili|j||||||j||||||||||||||| 

551 AGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCT 600 
254 CTACCAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAATTC 303 

IMIilMllilllllllllllllllilllllllllllMllilllllll 

601 CTACCAGACCATCAACA6TCTCAAAACCAAGAATCCCAAACTGAAAATTC 650 
304 TCTTGTCCAnGGAGGGTACCTGrrTGGnCCAAAGGGnCCACCCTATG 353 

lllilillMlillillllllMlllilllllllllllllllllllllll 

651 TCnGTCCAnGGAGGGTACCTGTTTGGnCCAAAGGGnCCACCCTATG 700 
354 GTGGAnCTTCTACATCACGCnGGAATTCAnAACTCCATAATCCTGTT 403 

MNIIilllilMllllllilllllllllllllllllllMIIIIIIII 

701 GTGGAnCTTCTACATCACGCTTGGAAnCATTAACTCCATAATCCTGTT 750 
404 TCTGAGGAACCATAACTnGATGGACTGGATGTAAGCTGGATCTACCCAG 453 

llllllllilllllillllllllllilllllllllMMIIMIilllK 

751 TCTGAGGAACCATAACTnGATGGACTGGATGTAAGCTGGATCTACCCAG 800 

, FIG.6A 
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454 


ATCAGAAA6AAAACACTCATTTCACTGTGCTGATTCAT6AGTTA6CAGAA 
illlllllllllllllllllllllllllllllllllllllllllllllll 

llllllllltilllliliiilllllllllitlllllllilliliilllil 

ATCAGAAAGAAAACACTCATTTCACTGTGCTGATTCATGAGnAGCAGAA 


503 


801 


850 


504 


GCCTTTCAGAAGGACTTCACAAAATCCACCAAGGAAAGGCnCTCTTGAC 


553 


851 


ilillllllMIIIIIMIIIMIMIIIIIIIIIIIIIIIIIIIIIMI 

6CCTTTCA6AAGGACTTCACAAAATCCACCAAGGAAAGGCTTCTCTTGAC 


900 


554 


TGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAGTTG 


603 


901 


illNliMNIIIIIINIIIIIIIIIIMIIIIIMIIMMIIMII 

iiiiiiiiiiiiiiiiiiiitiiiiiiiiiiiiiiiiiiiiiiiiiiii( 

TGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAGTTG 


950 


604 


AGAAACTGGCAAAAGATCTGGATTTCATCAACCTCCTGTCCTTTGACTTC 


653 


951 


illllillllllliilllMlilllllllllllMllillllllMIIII 
AGAAACTGGCAAAAGATCTGGAnTCATCAACCTCCTGTCCTTTGACTTC 


1000 


654 


CATGGGTCnGGGAAAAGCCCCTTATCACTGGCCACAACAGCCCTCTGAG 

illllllllllltlllllllllMliltllliilMlillilliillili 
CATGGGTCnGGGAAAAGCCCCTTATCACTGGCCACAACAGCCCTCTGAG 


703 


1001 


1050 


704 


CAA66GGTGGCAGGACAGAG6GCCAAGCTCCTACTACAATGTGGAATATG 


753 


1051 


lllillMIMMIIIIIIIIIIIIIllillllllllillMIMIIIIi 
CAAGGGGTGGCAGGACAGAGGGCCAAGCTCCTACTACAATGTGGAATATG 


1100 


754 


CTGTGG6GTACTGGATACATAAGGGAATGCCATCAGAGAAGGTG6TCATG 

inillllllltllMMIIIIilllllllllMIIIIIIIIIIIMIil 

CTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTGGTCATG 


803 


1101 


1150 


804 


GGCATCCCCACATATGGGCACTCCnCACACTGGCCTCTGCAGAAACCAC 

lllllllii(MlllllllilillilIillliillllltlllllllllll 
GGCATCCCCACATATGGGCACTCCTTCACACT6GCCTCTGCAGAAACCAC 


853 


1151 


1200 


854 


CGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGT 


903 


1201 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIill 
CGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGA6T 


1250 


904 


CTTCAGGCTTCCTGGCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCC 

IIIMMMillllllllllilllliilllllllllllllllllillil) 
CTTCAGGCnCCTGGCCTAnATGAGATCTGCCAGTTCCTGAAAGGAGCC 


953 


1251 


1300 



AAGATCACGAGGCTCCAGGATCAGCAGGnCCCTACGCAGTCAAGGGGAA 1003 

lllllllllllilllllllllllllllllllllillllllllllllllll 
AAGATCACGAGGCTCCAGGATCAGCAGGTTCCCTACGCAGTCAAGGGGAA 1 350 

FIG.6B 
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1004 


CCAGTGG6TGGGCTATGATGATGTGAAGAGTATGGAGACCAAGGTTCAGT 
1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 i 1 1 1 1 1 1 


1053 


1351 


M M 1 M M 1 1 1 1 1 i 1 N 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M N 
CCAGTGGGTGGGCTAT6ATGATGTGAAGAGTATGGAGACCAAGGTTCAGT 


1400 


1054 


TCnAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATTGACATG 


1103 


1401 


IIIIIIIIIIIMIIMIIIIIIIIIIjllllllllllMIIIIIIIIII 
TCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTAnGACATG 


1450 


1104 


GATGACnCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCnGTCCA 

IIIIMfillllllllllllllllllllllllllllllllliilllllll 

GATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCTTGTCCA 


1153 


1451 


1500 


1154 


AGCAGTCAAGAGAAGCCTTG6CTCCCTGTGAAGGATTAACTTACAGAGAA 

llllllllilllllMMIIIIIIIIiliillllilllllllllililM 
AGCAGTCAAGAGAAGCCTTGGCTCCCTGTGAaggattaacttacagagaa 


1203 


1501 


1550 


1204 


GCAGGCAAGATGACCTTGCTGCCTGGGGCCTGCTCTCTCCCAGGAAnCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 i 1 1 
U 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gcaggcaagatgaccttgctgcctggggcctgctctctcccaggaattct 


1253 


1551 


1600 


1254 


CATGTGGGAnCCCCTTGCCAGGCCGGCCTTTGGATCTCTCTTCCAAGCC 


1303 


1601 


llllllliilllillllllililMillllillillllllllllillill 
catgtgggattccccttgccaggccggcctttggatctctcttccaagcc 


1650 


1304 


TTTCCTGACTTCCTCTTAGATCATAGATTGGACCTGGnTTGTTTTCCTG 

1 1 i [ 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 i i 1 1 1 1 1 1 M 1 1 i 1 1 1 1 
1 i 1 1 1 1 1 1 1 M 1 M 1 1 N M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 ^ 

tttcctgacttcctcttagatcatagattggacctggttttgttttcctg 


1353 


1651 


1700 


1354 


CAGCTGATGACTTGTTGCCCTGAAGTACAATAAAAAAAATTCATnTGCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 


1403 


1701 


1 1 1 1 1 1 1 i 1 1 1 1 1 1 i i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ^ 
cagctgatgacttgttgccctgaagtacaataaaaaaaattcattttgct 


1750 


1404 


• CCAAAAAAAAAAAAAAAAAAAAACTCGAG 


1432 


1751 


IIIIIIIMMIIIIIIIIIIIIIIIIII 
ccagtaaaaaaaaaaaaaaaaaactcgag 


1779 
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1 MDQKSLWAGVVVLLLLQG 18 

IIIIIIIIIIIIIIMII 
1 MGATTMDQKSLWAGVVVLLLLQGEMGFCYVARAGLELLGSRSPPASASQS 50 

19 ....GSAYKLVCYFTNWSQDRQEPGKFTPENIDPFLCSHLIYSFASIENN 64 

illlllllllllilllllillllllllilMIIIIIIMMIIIII 
51 AGITGSAYKLVCYFTNWSQDRQEPGKFTPENIOPFLCSHLIYSFASIENN 100 

65 KVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFGSKGFHPMVOSST 114 

lllllllilillllllllllllllllltllllllllllllilllllllll 
101 KVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFGSKGFHPMVDSST 150 

115 SRLEFINSIILFLRNHNFDGLDVSWIYPOQKENTHFTVLIHELAEAFQKD 164 

MllllllllillllllllllilllilllilillllllillllMIIIII 
151 SRLEFINSIILFLRNHNFDGLDVSWIYPDQKENTHFTVLIHELAEAFQKD 200 

165 FTKSTKERLLLTAGVSAGRQMIDNSYQVEKLAKDLOFINLLSFDFHGSWE 214 

llllllllillllitllllllMlilllllliMllilllilllllilii 
201 FTKSTKERLLLTAGVSAGRQMIDNSYQVEKLAKDLDFINLLSFDFH6SWE 250 

215 KPLITGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTY 264 

lllllllillllllllillDIMiililllllllllilllllllliili 
251 KPLITGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTY 300 

265 GHSFTLASAETTVGAPASGPGAAGPITESSGFLAYYEICQFLKGAKITRL 314 

IIIIIIIIIIIIIIIIIMIIillllllllllllllllllllllllMII 
301 GHSFTLASAETTVGAPASGPGAAGPITESSGFLAYYEICQFLKGAKITRL 350 

315 QDQQVPYAVKGNQWVGYDDVKSMETKVQFLKNLNLGGAMIWSIDMDOFTG 364 

IIMIIIMIMIIIIIMIIIMIIIIilMIMIIIIIIIIIIMIII 

351 QDQQVPYAVKGNQWVGYDOVKSMETKVQFLKNLNLGGAMIWSIDMDDFTG 400 

365 K5CNQGPYPLVQAVKRSLGSL . 385 

lllllllllllllllllllll 
401 KSCNQGPYPLVQAVKRSLGSL* 422 



FIG.6D 
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A. The indications made below relate to ihc microorganism referred ui in the description 
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B. IDENTIFICATION OF DEPOSIT Further deposits arc idenilfied on an additional sh^ct Q 



Name of depositary institution 
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Address of depositary institution (inciuding postai codi: and country) 
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United States of America 



Dale of deposit 

May 3, 1996 


Accession Number 
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In respect of those designations in which a European Patent is sought a sample 
of the deposited microorganism will be made available until the publication of 
the mention of the grant of the European patent or until the date on which the 
application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issije of such a sample to an expert nominated by the person requesting the 
sanple (Rule 28(4) EPC) , 

D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (If tit^ indlcauons arc not for all designated States) 



E- SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 
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For rccciviog Office use only 



This sheet was received with (heSnternational application 
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For International Bureau use only 
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SINGAPORE 

The applicant hereby request that the furnishing of a sample of a microorganism shall only be made 
available to an expert. The request to this effect must be filed by the applicant with the International 
Bureau before the completion of the technical preparations for international publication of the 
application. 

NORWAY 

The applicant hereby request that, until the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be effected 
to an expert in die art. The request to this effect shall be filed by the applicant with the Norwegian 
Patent Office not later than at the time when the application is made available to the pbulic under 
Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been filed by tiie applicant, 
any request made by a third party for the furnishing of a sample shall indicate the expert to be used. 
That expert may be any person entered on a list of recognized experts drawn up by the Norwegian 
Patent office or any person approved by the applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only be 
effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the application, 
to a person who is a skilled addressee without an interest in the invention (Regulation 3.25(3) of the 
Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection (by 
the National Board of Patents and Registration), or has been finally decided upon by the National 
Board of Patents and Registration without having been laid open to public inspection, the furnishing 
of a sample shall only be effected to an expert in the art. 
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This intcmationaJ report has not been established in respect of certain claims under Article 1 7(2)(a) for the foUowing reasons: 
1. Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



□ 



Claims Nos.: 

because they relate to parts of the international application that do not comply wiih the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Claims Nos.: 

because they arc dependent claims and arc not drafted in accordance with the second and third sentences of Rule 6.4(t). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



^ * ^ required additional search fees were timely paid by the applicant, this international search report covers ail searchable 



claims. 



^' CD *^ searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. I ] As only some of the required additional -i;arch fees were timely paid by the applicant, this international search report covers 
only those claims far which fee:; were paid, specifically claims Nos.: 



^' IjL required additional search fees were timely paid by the applicant. Con.scqucnlly, this international search report is 



restricted to (he invention first mentioned in the claims; it is covered by claitns Nos.: 
1-15 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

n No protest accompanied the payment of additional search fees. 
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BOX n. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This iSA found multiple inventions as follows; 

Thb application contains the following inventions or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be examined, the appropriate additional 
cKamination fees must be paid. 

Group I, claims 1-15. drawn to DNA, vectors, host cells and expression of human chitinasc alpha and alpba-2. 
Group 11. claims 16, 17 and 19, drawn to human chitinasc alpha and alpha-2 protein and method of use. 
Group III, claim 18, drawn to human chitinasc alpha and alpha'2 antibody. 

Group iV. claim 20, drawn to a method of treating a tissue remodelling disorder with a '.uman chitinasc alpha or alpha- 
2 antagonist. 

The inventions listed as Groups I-IV do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2. they lack the same or corresponding special technical features for the following reasons: each of the 
four groups recites a chemically unrelated special technical feature. The special technical feature of Croup I is a 
nucleic acid encoding human chitinasc alpha or alpha-2, the special technical feature of Group 11 is a human chitinasc 
alpha or alpha-2 protein, the special technical feature of Group III is a human chitinasc alpha or alpha-2 antibody and 
the special technical feature of Group IV is a human chitinasc alpha or aIpha-2 anugonist. 
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